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STEM CELL 

The invention relates to a method to modulate the differentiation state of embryonic 
stem cells. 

5 

During mammalian development those cells that form part of the embryo up until the 
formation of the blastocyst are said to be totipotent (e.g. each cell has the 
developmental potential to form a complete embryo and all the cells required to 
support the growth and development of said embryo). During the formation of the 
10 blastocyst, the cells that comprise the inner cell mass are said to be pluripotential 
(e.g. each cell has the developmental potential to form a variety of tissues). 

Embryonic stem cells (ES cells, those with pluripotentiality) may be principally 
derived from two embryonic sources. Cells isolated from the inner cell mass are 

15 termed embryonic stem (ES) cells. In the laboratory mouse, similar cells can be 
derived from the culture of primordial germ cells isolated from the mesenteries or 
genital ridges of days 8.5-12.5 post coitum embryos. These would ultimately 
differentiate into germ cells and are referred to as embryonic germ cells (EG cells). 
Each of these types of pluripotential cell has a similar developmental potential with 

20 respect to differentiation into alternate cell types, but possible differences in 
behaviour (eg with respect to imprinting) have led to these cells to be distinguished 
from one another. Hereinafter embryonic stem cells will encompass both these stem 
cell — types. 

« 

25 Typically ES cell cultures have well defined characteristics. These include, but are 
not limited to; maintenance in culture for at least 20 passages when maintained on 
fibroblast feeder layers; produce clusters of cells in culture referred to as embryoid 
bodies; the ability to differentiate into multiple cell types in monolayer culture; and 
express ES cell specific markers. 

30 
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Until very recently, in vitro culture of human ES cells was not possible. The first 
indication that conditions may be determined which could allow the establishment of 
human ES cells in culture is described in W096/22362. The application describes 
5 cell lines and growth conditions which allow the continuous proliferation of primate 
ES cells which exhibit a range of characteristics or markers which are associated with 
stem cells having pluripotent characteristics. 

More recently Thomson et al (1998) have published conditions in which human ES 
10 cells can be established in culture. The above characteristics shown by primate ES 
cells are also shown by the human ES cell lines. In addition the human cell lines 
show high levels of telomerase activity, a characteristic of cells which have the 
ability to divide continuously in culture in an undifferentiated state. Another group 
(Reubinoff et. al., 2000) have also reported the derivation of human ES cells from 
15 human blastocysts. A third group (Shamblott et. aL, 1998) have described EG cell 
derivation. 

A feature of ES cells is that, in the presence of fibroblast feeder layers, they retain the 
ability to divide in an undifferentiated state for several generations. If the feeder 

20 layers are removed then the cells differentiate. The differentiation is often to 
neurones or muscle cells but the exact mechanism by which this occurs and its 
control remain unsolved. It would be desirable to have a reliable culture system 
which does not require the presence of fibroblast feeder cells but includes the 
addition of a factor(s) which maintain ES cells in an undifferentiated state. A 

25 prerequisite to the successful exploitation of ES cells in tissue engineering is to 
provide a reliable and defined cell culture system which can be used to control the 
differentiation of ES cells into a selected cell-type. The identification of gene targets 
involved in maintaining ES cells as ES cells and the identification of gene targets 
involved in differentiation will facilitate this objective. 

2 
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We have identified a regulatory pathway involved in the mechanism by which ES 
cells are maintained as ES cells in culture and which also influences the 
differentiation of said cells in culture. The regulatory pathway comprises two 
families of genes referred to as Notch and Wnt. 

5 

The Notch gene is a Drosophila prototype for a family of homologues found in 
diverse species, encoding large, single-span, transmembrane receptors (reviewed' in 
"Weinmaster, 199-7). Within the extracellular domain, located distally from the 
transmembrane region, are found multiple (10-36), tandem arrays of epidermal 

10 growth factor-like repeats (Wharton et al., 1985; Kopezynski et al., 1988). More 
proximally are found 3 cysteine-rich, Lin-12/Notch repeats and two conserved 
cysteine residues. The intracellular domain contains, from proximal to distal with 
respect to the transmembrane region, a subtransmembrane region (STR), six ankyrin 
repeats and a region rich in proline, glutamic acid, serine and threonine (PEST). The 

1 5 generic Notch structure is illustrated in Figure 1 . 

Witt genes encode diffusible, extracellular signalling molecules of around 350-400 
amino acids in length, defined by a characteristic pattern of conserved cysteine 
residues, along with other invariant amino acids (see 
20 http://www.stanford.edii/-musse/wntwindow.html. 

In the 1970s, the wingless (wg l ) mutation of Drosophila melanogaster was 
described, in which affected individuals showed aberrant wing and haltere 
development (Sharma, 1973; Sharma and Chopra, 1976). When the gene disrupted by 

25 this mutation was subsequently identified, the predicted 468aa peptide sequence 
exhibited remarkable similarity to that of a murine gene, int-1 (Cabrera et al., 1987; 
Rijsewijk et al., 1987), including an identical pattern of 23 conserved cysteine 
residues, int-1 had earlier been identified as a common integration site of the murine 
mammary tumour virus, and a likely cellular oncogene (Nusse and Varmus, 1982; 

30 van Ooyen and Nusse, 1984). Thus, the two prototypic members of the Wnt gene 
family were described. Since that time, numerous homologues of wingless I int-1 have 
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been identified in divergent organisms, including Caenorhabditis elegans, 
Drosophila melanogaster, Xenopus laevis, chicken, mouse and humans (reviewed in 
Cadigan and Nusse, 1997; Wodarz and Nusse, 1998). Lower organisms appear to 
possess a limited repertoire of Wnt genes in comparison to higher organisms, 
5 presumably reflecting their lesser developmental complexity. Additionally, 
vertebrates appear to express multiple, closely related orthologues of certain Wnts. 
The Wnt family is composed of more than 60 members, with 14 human homologu.es 

m 

alone. Well-documented roles exist for Wnt signalling in a variety of developmental 
processes, including cell fate specification and patterning within the central nervous 
1 0 system. 

Wnt ligands interact with membrane-bound receptors of the frizzled family, leading 
to activation of a cytoplasmic protein, Dishevelled. Dishevelled inhibits Notch 
activation (2) and also inhibits the activity of an Axin-APC-GSK-3b complex, 

15 promoting formation of a bipartite transcriptional activator comprising b-catenin and 
TCF (4). Wnt signalling may be antagonised by extracellular molecules that compete 
for Wnt binding, including frizzled related proteins (FRP), Wnt inhibitory factors 
(WIF), Dickkopf and Cerberus. Expression of Wnt target genes may also be regulated 
by other proteins that bind to and alter the function of TCF. CREB-Binding Protein 

20 (CBP) exhibits a mutually antagonistic binding affinity for TCF with b-catenin and 
converts TCF into a repressor of target genes (8). Additionally, Notch activation may 
induce transcriptional repression by TCF, even in the presence of b-catenin, through 
expression of the TLE class of putative target genes (5,7). 

25 As. a model system to test the involvement of Notch and Wnt genes in the 
differentiation of ES cells we have used embryonal carcinoma cells which are stem 
cells of teratocarcinomas. The stem cells of early embryos and the stem cells of 
teratocarcinomas have been demonstrated experimentally to be capable of 
substituting for one another in their respective roles. Thus, an embryonic stem cell 

30 introduced to a syngeneic host may give rise to a t er ato carcinoma containing all of 
the elements that would be found in a spontaneous tumour of this type (Mintz et al 
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1978). Likewise, embryonal carcinoma cells derived from a spontaneous germ cell 
carcinoma may participate in embryonic development, and generate normal somatic 
tissue following injection into a blastocyst (Brinster 1974; Mintz and Hlmensee 1975; 
Papaioannou et al 1975). This clearly demonstrates that murine EC cells may respond 
5 to developmental cues in an appropriate manner, and that their differentiation may 
provjde information pertinent to normal embryogenesis. Similarly, human EC cells 
may provide an insight into the processes that regulate human development. 

The TERA2 cell line was derived from a lung metastasis of a human teratocarcinoma 
10 in the mid 1970s (Fogh and Trempe, 1975). Morphologically, TERA2 cultures are 
quite divergent from the characteristic EC phenotype and display significant 
heterogeneity, suggesting that these cells undergo spontaneous differentiation 
(Andrews et al., 1980). However, a tumour containing both embryonal carcinoma 
cells and differentiated derivatives was produced following injection of TERA2 into 
15 a nude mouse host (Andrews et al., 1983a; Andrews et al., 1983b; Andrews et al., 
1984). A cell line established from the EC component of this tumour, named 
NTERA2, closely resembled and maintained the characteristic EC phenotype in 
culture and, unlike the parent line, was able to produce teratocarcinoma in nude mice 
with high frequency (Andrews et al., 1983a; Andrews et al., 1983b; Andrews et al., 
20 1984). Additionally, various subclones of NTERA2 exhibit the ability to 
differentiate extensively in vitro following treatment with chemical inducers (eg 
retinoic acid (RA), HMB A ) (Andrews, 1984; Andrews et al, 1986). 

The expression of human Notch homologues were examined in NTERA2 to 
25 determine their involvement in ES cell differentiation. 

We have discovered that members of the Notch gene family, Afofc/z/ (Genbank 
accession number AF3 08602), Notch2 (Genbank accession number NM_024408) 
and Notch3 (Genbank accession number NM_000435) are expressed in EC cells and 
30 NTERA2 cells. Notch 1 expression was detected as a mRNA band of around 7Kb in 
both EC and differentiated cultures of NTERA2. NotchS, like Notchl, was 
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examined in EC cells. A transcript of around 8Kb was readily detected in all samples. 
The endo derm- specific Notch4 (Genbank accession number XM_004207) was not. 

All three Notch homologu.es expressed by NTERA2 showed altered transcription 
5 during differentiation in response to retinoic acid. In each case, however, these 
changes were modest and expression was evident in both EC and differentiated 
cultures. The role of the Notch pathway in directing EC/ES differentiation may thus 
depend to a greater extent on the level of signalling activation rather than the 
abundance of the receptors. Ih order to investigate this possibility, the expression of 

TO candidate ligands for Notch receptors were examined. For example, dlk (Genbank 
accession number U15979) was detected at high levels in EC cultures, but its 
expression was almost extinguished by 3 days following RA treatment. Low levels 
were also observed through 7 and 14 days post-RA. However, by 21 days, dlk was 
up-regulated to the level seen in EC cultures. These profound changes may reflect an 

15 important role for dlk and other DSL ligands in regulating EC/ES differentiation 
through altered Notch signalling activation. This data is suggestive that the Notch 
signalling pathway is involved in regulating EC cell differentiation and, by 
extrapolation, human ES cell differentiation. 

20 A degenerate PCR strategy was used to investigate the possible expression of novel 
Wnt genes in the NTERA2 system. The expression of a single Wnt gene, Wnt~13, 
was detected in NTERA2. Wnt- 13 was absent in EC cells, but showed induction and 
subsequent up-regulation following both retinoic acid and HMBA treatment. Both of 
these agents bring about extensive differentiation of NTERA2, accompanied by the 

25 loss of typical human EC surface markers. 

We have examined the expression of components of the Wnt pathway and of 
transcripts corresponding to other proteins known to interact with Wnt signalling in 
NTERA2 cells. These cells are a model system for aspects of human embryogenesis 
30 and differentiate extensively in vitro in response to chemical inducers. Among the 
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cell types produced following retinoic acid treatment are functional, post-mitotic, 
CNS neurons (1,6, 17). • 

The modulation of the Notch and Wnt , signalling pathways may facilitate 
5 manipulation of embryonic stem cell differentiation. The term modulation refers to 
either the maintenance of embryonic stem cells as embryonic stem cells or the 
facilitation of differentiation of embryonic stem cells along defined cell lineages. 

According to an aspect of the invention there is provided a method to modulate the 
10 phenotype of an embryonic stem cell comprising contacting said cell with a ligand 
binding domain of a polypeptide wherein said domain binds its cognate receptor 
expressed by said cell to modulate said phenotype. 

According to a further aspect of the invention there is provided a method to modulate 
1 5 the differentiation of an embryonic stem cell comprising: 

i) providing a culture of embryonic stem celis; 

ii) providing at least one ligand, or the active binding fragment thereof, capable 
of binding its cognate receptor polypeptide expressed by said embryonic stem 
cell; 

* 

20 iii) forming a culture comprising embryonic stem cells and said ligand; and 
iv) growing said cell culture. 

hi a preferred method of the invention said ligand is encoded by a nucleic acid 
molecule selected from the group consisting of: 
25 i) a nucleic acid molecule as represented in Figure 22; 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and 
which encodes a ligand capable of binding a Wnt receptor; and 

iii) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (i) and (ii) above. 

30 
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In a preferred method of the invention said ligand is selected from the group 
consisting of: WNT 1; WNT 2, WNT 3; WNT 4; WNT 5 A; WNT 6; WNT 7A; 
WNT 8B; WNT 10B; WNT 11; WNT 14; WNT 16. 

5 In a further preferred method of the invention said ligand is WNT 1 3 . 



In an alternative preferred method of the invention said ligand is encoded by a 
nucleic acid molecule selected from the group consisting of: 

i) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, or 
10 18. 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

iii) nucleic acid molecules which are degenerate as a result of the genetic code to 
15 the sequences defined in (i) and (ii) above. 

In a further preferred method of the invention said ligand is selected from the group 
represented by the amino acid sequences in Figures 3, 6, 8, 9, 11, 13, 15, 17, 19, or 
polypeptide variants thereof. 

20 

Polypeptide variants are polypeptide sequences having at least 75% identity with the 
polypeptide sequences as herein, disclosed, or fragments and functionally equivalent 
polypeptides thereof. In one embodiment, the polypeptides have at least 85% identity, 
more preferably at least 90% identity, even more preferably at least 95% identity, still 
25 more preferably at least 97% identity, and most preferably at least 99% identity with 
the amino acid sequences illustrated herein. 

In a further preferred method of the invention said cells are induced to differentiate 
by the addition of at least one agent selected from the group consisting of: retinoic 
30 acid; HMBA ; bone morphogenetic proteins ; bromodeoxyuridine; lithium; sonic 
hedgehog . 
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Optionally the inducing agent and the ligand are added simultaneously to a culture of 
embryonic stem cells. Alternatively, the ligand is added before addition of said 
inducing agent. 



5 According to a further aspect of the invention there is provided a method for 
modulating the differentiation of embryonic stem cells comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, 18. 
10 b) a nucleic acid molecule which hybridises to the nucleic acid in (ii) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; and 
c) nucleic acid molecules which are degenerate as a result of the genetic code to 
the sequences defined in (a) and (b) above. 

ii) forming a culture comprising the cell identified in (i) above with an 
1 5 , embryonic stem cell; and 

iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of said embryonic stem cell. 



According to a yet further aspect of the invention there is provided a method for 
20 modulating the differentiation of embryonic stem cells comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group comprising: 

a) a nucleic acid molecule as represented by the sequence in Figure 22; 

b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
25 which encodes a ligand capable of binding a Wnt receptor; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

ii) forming a culture comprising a cell as identified in (i) above with an 
embryonic stem cell; and - 

30 iii) growing said culture under conditions suitable for the maintenance and/or 

differentiation of embryonic stem cells. 
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In a preferred method of the invention said cell expresses Wnt-13. 

Optionally the cells expressing the ligand(s) are mixed with a culture of 
5 undifferentiated embryonic stem cells. This is followed by addition of the inducing 
agent ( eg retinoic acid; HMBA, bone morphogenetic proteins; bromodeoxyuridine; 
lithium; sonic hedgehog). 

In a preferred method of the invention said nucleic acid molecule hybridises under 
10 stringent hybridisation conditions to the nucleic acid molecules represented in (a) 3 (b) 
or (c) above. 

Stringent hybridisation or washing conditions are well known in the art. For example, 
nucleic acid hybrids that are stable after washing in O.lxSSC, 0.1% SDS at 60°C. It is 

15 well known in the art that optimal hybridisation conditions can be calculated if the 
sequence of the nucleic acid is known. For example, hybridisation conditions can be 
determined by the GC content of the nucleic acid subject to hybridisation. Please see 
Sambrook et al (1989) Molecular Cloning; A Laboratory Approach. A common 
formula for calculating the stringency conditions required to achieve hybridisation 

20 between nucleic acid molecules of a specified homology is: 

T m = 81.5° C + 16.6 Log [Na + ] + 0.41[ % G + C] -0.63 (%formamide) 



25 In a further preferred method of the invention the nucleic acid molecule is genomic 
DNA or cDNA. 

In a preferred method of the invention the nucleic acid molecule encodes a ligand of 
human origin. 

30 

In a further preferred method of the invention said embryonic stem cells are of human 
origin. 
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In a yet further preferred method of the invention the cell transfected with the nucleic 
acid according to the invention is a mammalian cell. Preferably the cell is selected 
from the following group: a Chinese hamster ovary cell; murine primary fibroblast 
5 cell; human primary fibroblast cell; transformed mouse fibroblast cell-line STO. 

According to a further aspect of the invention there is provided a method for 
inhibiting the differentiation of embryonic stem cells or embryonal carcinoma cells 
comprising: 

10 

i) providing at least one polypeptide, or active fragment thereof, wherein said 
polypeptide is an inhibitor of the Wnt signalling pathway. 

ii) forming a culture comprising the polypeptide identified in (i) above with an 
embryonic stem cell; and 

15 iii) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 

In a preferred method of the invention said inhibitor of Wnt signalling is selected 
from the group comprising the active binding fragments thereof of the following 
20 polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory Factors (WIF); 
Dickkopf; Cerebrus. 

In a further preferred method of the invention said inhibitor of Wnt signalling is 
selected from the group comprising the active binding fragments thereof of the 
25 following polypeptides: SFRP1; SFRP4; FRZB; SFRP2; FZD1; FZD2; FZD9; FZD3; 
FZD5; FZD4; FZD6; FZD7; DVL2; DVL3; GSK3B; AXIN1; APC; TCF1; WDM; 
CER 1; DKK1-4; SARP 2; SARP 3. 

According to a further aspect of the invention there is provided a method for 
30 inhibiting the differentiation of embryonic stem cells or embryonal carcinoma cells 
comprising: 
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i) • providing a cell transfected with a nucleic acid molecule selected from the 

* 

group consisting of: 

a) a nucleic acid molecule encoding a Wnt inhibitory polypeptide; 
5 b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 

which encodes a polypeptide capable of inhibiting Wnt signalling; and 
c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

ii) contacting the cell of (i) above with a culture of embryonic stem cells; and 

10 iii) growing said culture under conditions suitable for the maintenance of 

embryonic stem cells in an undifferentiated state. 

In a preferred method of the invention said cells express at least one Wnt inhibitory 
polypeptide selected from the group comprising the active binding fragments thereof 

15 of the following polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory 
Factors (WIF); Dickkopf; Cerebrus. Preferably said cells express at least one Wnt 
inhibitory polypeptide selected from the group comprising the active binding 
fragments thereof of the following polypeptides: SFRP1; SFRP4; FRZB; SFRP2; 
FZD1; FZD2; FZD9; FZD3; FZD5; FZD4; FZD6; FZD7; DVL2; DVL3; GSK3B; 

20 AXIN1; APC; TCF1; WIF-1; CER-1; DKK1-4 

In a further preferred method of the invention the nucleic acid molecule is encoded 
by a nucleic acid molecule which hybridises under stringent hybridisation conditions 
to the nucleic acid molecules represented in (a), (b) or (c) above. Preferably said 
25 inhibitors are human. 

According to a further aspect of the invention there is provided a vector comprising 
the nucleic acid molecule according to the invention. Preferably the vector is an 
expression vector adapted for the expression of the polypeptide encoded by said 
30 nucleic acid molecule. 
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Typically said adaptation includes, by example and not by way of limitation, the 
provision of transcription control sequences (promoter sequences) which mediate 
cell/tissue specific expression. These promoter sequences may be cell/tissue specific, 
inducible or constitutive. 

5 

Promoter is an art recognised term and, for the sake of clarity, includes the following 
features which are provided by example only, and not by way of limitation. Enhancer 
elements are cis acting nucleic acid sequences often found 5' to the transcription 
initiation site of a gene ( enhancers can also be found 3' to a gene sequence or even 

10 located in intronic sequences and is therefore position independent). Enhancers 
function to increase the rate of transcription of the gene to which the enhancer is 
linked. Enhancer activity is responsive to trans acting transcription factors 
(polypeptides) which have been shown to bind specifically to enhancer elements. The 
binding/activity of transcription factors (please see Eukaryotic Transcription Factors, 

15 by David S Latchman, Academic Press Ltd, San Diego) is responsive to a number of 
environmental cues which include, by example and not by way of limitation, 
intermediary metabolites (eg glucose, lipids), environmental effectors (eg light, 
heat,). 

20 Promoter elements also include so called TATA box and RNA polymerase initiation 
selection (RIS) sequences which function to select a site of transcription initiation. 
These sequences also bind polypeptides which function, inter alia, to facilitate 
transcription initiation selection by RNA polymerase. 

25 Adaptations also include the provision of selectable markers 
and autonomous replication sequences which both facilitate the maintenance of said 
vector in either the eukaryotic cell or prokaryotic host. Vectors which are maintained 
autonomously are referred to as episomal vectors. Episomal vectors are desirable 
since these molecules can incorporate large DNA fragments (30-50kb DNA). 

30 Episomal vectors of this type are described in WO98/07876. Alternatively, the 
vector is an integrating vector. 
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Adaptations which facilitate the expression of vector encoded genes include the 
provision of transcription termination/polyadenylation sequences. This also includes 
the provision of internal ribosome entry sites (IRES) which function to maximise 
5 expression of vector encoded genes arranged in bicistronic or multi-cistronic 
expression cassettes. 

These adaptations are well known in the art. There is a significant amount of 
published literature with respect to expression vector construction and recombinant 

♦ 

10 DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
references therein; Marston, F (1987) DNA Cloning Techniques: A Practical 
Approach Vol IH IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, inc.(1994). 

15 

. Conventional methods to introduce DNA or vector DNA into cells are well known in 
the art and typically involve the use of chemical reagents, cationic lipids or physical 
methods. Chemical methods which facilitate the uptake of DNA by cells include the 
use of DEAE -Dextran ( Vaheri and Pagano Science 175: p434) . DEAE-dextran is a 
20 negatively charged cation which associates and introduces the DNA into cells but 
which can result in loss of cell viability. Calcium phosphate is also a commonly used 
chemical agent which when co-precipitated with DNA introduces the DNA into cells 
(Graham et al Virology (1973) 52: p456). 

25 The use of cationic lipids (eg liposomes, Feigner (1987) Proc.Natl.Acad.Sci USA, 
84:p7413) has become a common method since it does not have the degree of 
toxicity shown by the above described chemical methods. The cationic head of the 
lipid associates with the negatively charged nucleic acid backbone of the DNA to be 
introduced. The lipid/DNA complex associates with the cell membrane and fuses 

30 with the cell to introduce the associated DNA into the cell. Liposome mediated DNA 

► 

transfer has several advantages over existing methods. For example, cells which are 
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recalcitrant to traditional chemical methods are more easily transfected using 
liposome mediated transfer. 

More recently still, physical methods to introduce DNA have become effective means 
5 to reproducibly transfect cells. Direct microinjection is one such method which can 
deliver DNA directly to the nucleus of a cell ( Capecchi (1980) Cell, 22:p479). This 
allows the analysis of single cell transfectants. So called "biolistic" methods 
physically shoot DNA into cells and/or organelles using a particle gun (Neumann 
(1982) EMBO J, 1: p841). Electroporation is arguably the most popular method to 
10 transfect DNA. The method involves the use of a high voltage electrical charge to 
momentarily permeabilise cell membranes making them permeable to. 
macromolecular complexes. However physical methods to introduce DNA do result 
in considerable loss of cell viability due to intracellular damage. These methods 
therefore require extensive optimisation and also require expensive equipment. 

15 

More" 'recently still a method termed immtmopofatioh has become a recognised 
techinque for the introduction of nucleic acid into cells, see Bildirici et al, Nature 
405, 769. The technique involves the use of beads coated with an antibody to a 
specific receptor. The transfection mixture includes nucleic acid, typically vector 

20 DNA, antibody coated beads and cells expressing a specific cell surface receptor. The 
coated beads bind the cell surface receptor and when a shear force is applied to the 
cells the beads are stripped from the cell surface. During bead removal a transient 
hole is created through which nucleic acid and/or other biological molecules, eg 
polypeptides, can enter. Transfection efficiency of between 40-50% is achievable 

25 depending on the nucleic acid used. 



Other non-liposome based, chemical transfectant agents have become available, for 
example ExGen500 (polyethylenimine), produced by MBI Fermentas. ExGenSOO is 
particularly effective for transfection of human ES cells (Eiges, 2001). 
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According to a further aspect of the invention there is provided a method for the 
production of the polypeptide encoded by the nucleic acid molecule according to the 
invention comprising: 

i) providing a cell transformed/transfected with a nucleic acid molecule 
5 according to the invention; , 

ii) growing said cell in conditions conducive to the manufacture of said 
polypeptide; and 

i) purifying said polypeptide from said cell, or its growth environment. 

In a preferred method of the invention said nucleic acid molecule is the vector 

* 

1 0 according to the invention. 

In a further preferred method of the invention said vector encodes, and thus said 
recombinant polypeptide is provided with, a secretion signal to facilitate purification 
of said polypeptide. 

15 

According to a further aspect of the invention there are provided host cells which 
have been transformed/transfected with the vector according to the invention, so as to 
include at least part of the polypeptide according to the invention, so as to permit 
expression of at least the functional part of the polypeptide encoded by said nucleic 
20 acid molecule. 

Ideally said host cells are eukaryotic cells, for example, insect cells such as cells from 
a species Spodoptera frugiperda using the baculovirus expression system. 

25 According to a further aspect of the invention there is provided a therapeutic cell 
composition comprising differentiated or differentiating embryonic stem cells 
derived by the method according to the invention. Preferably said composition is for 
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use in the treatment of: Parkinson's disease; Huntington's disease; motor neurone 
disease; heart disease; diabetes; liver disease (eg cirrhosis); renal disease; AIDS. 

According to a further aspect of the invention there is provided a method of treatment 
5 of an animal comprising administering a cell composition comprising embryonic 
stem cells which have been induced to differentiate into at least one cell-type. 

According to a yet further aspect of the invention there is provided condition medium 
obtained by culturing embryonic stem cells according to any of the methods 
1 0 hereindisclosed. 

An embodiment of the invention will know be described by example only and with 
reference to the following figures: 

15 Figure 1 is a schematic represenation of conserved domains in Notch polypeptides; 
Figure 2 is the nucleic acid sequence of murine notch ligand delta-like 1; 
Figure 3 is the amino acid sequence of murine notch ligand delta-like 1 ; 

20 

Figure 4 is the nucleic acid sequence of murine notch ligand jagged 1; 

Figure 5 is the nucleic acid sequence of human notch ligand jagged 1 (alagille 
syndrome) (JAG1); 

25 

Figure 6 is the amino acid sequence of human notch ligand jagged 1 (alagille 
syndrome); 

Figure 7 is the nucleic acid sequence of human notch ligand jagged 2 (JAG2) 

30 

Figure 8 is the amino acid sequence of human notch ligand jagged 2 (JAG2); 
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Figure 9 is the amino acid sequence of murine notch ligand jagged 1; 



Figure 10 is the nucleic acid sequence of murine notch ligand jagged 2; 

5 

Figure 1 1 is the amino acid sequence of murine notch ligand jagged 2; 



Figure 12 is the nucleic acid sequence of human notch ligand delta-like 3 (DLLS); 



10 Figure 13 is the amino acid sequence of human notch ligand delta-like 3 precursor 
polypeptide; 



Figure 14 is the nucleic acid sequence of human notch ligand delta- 1 (DLL1); 



15 Figure 15 is the amino acid sequence of murine notch ligand delta- like 1; 



Figure 16 is the nucleic acid sequence of human notch ligand delta-like 4 (DLL4); 



Figure 17 is the amino acid sequence of human notch ligand delta-like 4 (DLL4); 

20 

Figure 18 is the nucleic acid sequence of murine notch ligand delta-like 4(DLL4); 



Figure 19 is the amino acid sequence of murine notch ligand delta-like 4(DLL4); 



25 Figure 20 is a western blot of cell extracts of various EC cell-lines probed with 
Notch 2 antisera; 



Figure 21 represents northern blot analysis of the expression patterns of notch genes 
{Notch 1,2,3) and notch ligands (Dlk, jagged 1) in. EC cells and EC cells treated with 
30 retinoic acid (RA); 
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Figure 22 represents the nucleic acid sequence of human Wnt 13; 

Figure 23 is a diagramatic representation of the Wnt signalling pathway; 

5 Figure 24 represents northern blot analysis of Wnt 13 and mKNA's corresponding to 
Frizzled receptors and Frizzled related protein antagonists of Wnt signalling in 
NTERA 2 cells various Wnt inhibitors after exposure of NTERA 2 cells; 

Figure 25 represents a northern blot analysis of intracellular components of Wnt 
10 signalling pathway in NTERA 2 cells; 

Figure 26 represents the nucleic acid sequence of human dickkopfl ; 

Figure 27 represents the nucleic acid sequence of human dickkopfl; 

15 

Figure 28 represents the nucleic acid sequence of human dickkdpf3; arid 
Figure 29 represents the nucleic acid sequence of human dickkopf4; 
20 Figure 30 represents the nucleic acid sequence of WNT-1; 
Figure 31 represents the amino acid sequence of WNT-1; 
Figure 32 represents the nucleic acid sequence of WNT-2; 

25 

Figure 33 represents the amino acid sequence of WNT-2; 
Figure 34 represents the nucleic acid sequence of WNT 2B; 
30 Figure 35 represents the amino acid sequence of WNT 2B; 
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Figure 36 represents the nucleic acid sequence of WNT 3; 
Figure 37 represents the amino acid sequence of WNT 3; 
5 Figure 38 represents the nucleic acid sequence of WNT 4; 
Figure 39 represents the amino acid sequence of WNT 4; 
Figure 40 represents the nucleic acid sequence of WNT 5 A; 

10 

Figure 41 represents the amino acid sequence of WNT 5 A; 
Figure 42 represents the nucleic acid sequence of WNT 6; 
15 Figure 43 represents the amino acid sequence of WNT 6; 

Figure 44 represents the nucleic acid sequence of WNT 7A; 
Figure 45 represents the amino acid sequence of WNT 7 A; 

20 

. Figure 46 represents the amino acid sequence of WNT 7B; 
Figure 47 represents the nucleic acid sequence of WNT 8B; 
25 Figure 48 represents the amino acid sequence of WNT 8B; 
Figure 49 represents the nucleic acid sequence of WNT 10B; 
Figure 50 represents the amino acid sequence of WNT 10B; 

30 

Figure 51 represents the nucleic acid sequence of WNT 11; 
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Figure 52 represents the amino acid sequence of WNT 11; 
Figure 53 represents the nucleic acid sequence of WTSTT 14 

5 

Figure 54 represents the amino acid sequence of WNT 14; 

§ 

Figure 55 represents the nucleic acid sequence of WNT 16; 
10 Figure 56 represents the amino acid sequence of WNT 16; 
Figure 57 represents the nucleic acid sequence of FZD 1; 
Figure 58 represents the amino acid sequence of FZD 1; 

15 

Figure 59 represents the nucleic acid sequence of FZD 2; 
Figure 60 represents the amino acid sequence of FZD 2; 
20 Figure 61 represents the nucleic acid sequence of FZE 3; 
Figure 62 represents the amino acid sequence of FZE 3; 
Figure 63 represents the nucleic acid sequence of FZD 4; 

25 

Figure 64 represents the amino acid sequence of FZD 4; 
Figure 65 represents the nucleic acid sequence of FZD 5; 
30 Figure 66 represents the amino acid sequence of FZD 5; 
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Figure 67 represents the nucleic acid sequence of FZD 6; 
Figure 68 represents the amino acid sequence of FZD 6; 
5 Figure 69 represents the nucleic acid sequence of FZD 7; 
Figure 70 represents the amino acid sequence of FZD 7; 
Figure 71 represents the nucleic acid sequence of FZD 8; 

10 

Figure 72 represents the amino acid sequence of FZD 8; 
Figure 73 represents the nucleic acid sequence of FZD 9; 
15 Figure 74 represents the amino acid sequence of FZD 9; 
Figure 75 represents the nucleic acid sequence of FZD 10; 
Figure 76 represents the amino acid sequence of FZD 10; 

20 

Figure 77 represents the nucleic acid sequence of FRP; 

'4 

Figure 78 represents the amino acid sequence of FRP; 

25 Figure 79 represents the nucleic acid sequence of SARP 1; 

Figure 80 represents the amino acid sequence of SARP 1; 
Figure 8 1 represents the nucleic acid sequence of SARP 2; 

30 Figure 82 represents the amino acid sequence of SARP 2; 
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Figure 83 represents the nucleic acid sequence of FRZB; 
Figure 84 represents the amino acid sequence of FRZB; 
5 Figure 85 represents the nucleic acid sequence of FRPHE; 
Figure 86 represents the amino acid sequence of FRPHE; 
Figure 87 represents the nucleic acid sequence of SARP 3; 

10 

Figure 88 represents the amino acid sequence of SARP 3; 
Figure 89 represents the nucleic acid sequence of CER 1; 
15 Figure 90 represents the amino acid sequence of CER 1; 
Figure 91 represents the nucleic acid sequence of DKK1; 
Figure 92 represents the amino acid sequence of DKK1; 

20 

Figure 93 represents the nucleic acid sequence of DKK 2; 

Figure 94 represents the amino acid sequence of DKK 2; 

25 Figure 95 represents the nucleic acid sequence of DKK 3; 

Figure 96 represents the amino acid sequence of DKK 3; 
Figure 97 represents the nucleic acid sequence of DKK 4; 

30 Figure 98 represents the amino acid sequence of DKK 4; 
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Figure 99 represents the nucleic acid sequence of WIF-1; 
Figure 100 represents the amino acid sequence of WIF-1; 
5 Figure 101 represents the nucleic acid sequence of SRFP 1; 
Figure 1 02 represents the amino acid sequence of SRFP 1 ; 
Figure 1 03 represents the nucleic acid sequence of SRFP 4; 

10 

Figure 104 represents the amino acid sequence of SRFP 4; and 

Figure 105 represents a diagram depicting the pCMV-tracer vector. 

15 Materials and Methods 

Table 1 Cell lines derived from germ cell tumours. 



Cell Line 


Biopsy Site 


Biopsy Histology 


Xenograph 
Histology 


Reference 


2102Ep 


Testis 


EC, T, Y 


EC , 


(Andrews et aL, 
1980) 


833KE 


Testis 


EC, T, C, S 


EC 


(Andrews et aL, 
1980) 


TERA-1 


Lung 


EC,T 




(Fogh and 
Trempe, 1975) 


NTERA2 cl.Dl 


Lung 


EC,T 


EC,T 


(Fogh and 
Trempe, 1975) 
(Andrews, 1984) 


Abbreviations 


used: EC, embryonal carcinoma, 


T, teratoma, S, 


seminoma, C, 


choriocarcinoma, Y, yolk-sac carcinoma 
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Cell Lines derived from gestational choriocarcinomas. 



BEWO Corresponds to gestational choriocarcinoma (Pattillo and Gay, 

1968) 



5 List of Antibodies Used 



Antibody 


Reference 


References 


SSEA-3 


Andrews et. al., 1982 


12 


SSEA-4 


Kannagi et. al., 1983 


18 


Tra-1-60 


Andrews et. al., 1984 


25 


Tra-1-81 


Andrews et. al., 1984 


25 


Tra-2-54 


Andrews et. al., 1984 


20 


Tra-2-49 


Andrews et. al., 1984 


20 


A2B5 


Fenderson et. al., 1987 




ME311 


Fenderson et. al., 1987 




Vin-is-56 


Andrews et. al., 1990 


44 


Vin-is-53 


Andrews et. al., 1990 


. 44 


Vin-2PB-22 


Andrews et. al., 1990 


44 


Thy-1 


Andrews et. al., 1983 


10 



Expression Vectors 



10 The following mammlian expression vectors are used in the expression of ligands 
hereindisclosed: 

Purchased from Stratagene Inc. pExchange-1; pExchange-2; pExchange~3A ? 3B, 3C; 
pExchange-4A, 4B, 4C; pExchange-5A, 5b,5C; pExchange-6A ? 6B, 6C; pExchange 
module EC-hyg; pExchange module EC-Puro; pExchange module EC-Neo; pCMV- 
15 Script; pCMV-Tagl; pCMV-Tag2; pCMV-Tag3; pCMV-Tag4; pCMV-Tag5; 
pCMVLACI, pOPRSWMCS, pOPD-CAT ; pERV3; pEGSH. 

Purchased from Invitrogen Inv. 
T-KEX System vectors 

20 pcDNA4/TO; pcDNA4/TO/myc-His; pcDNA6/TR; pT-Rex-DEST30; pT-Rex- 
DEST31; pcDNA4/TO-E; pcDNA5/FRT/TO; pcDNA5/FRT/TO-TOPO. 
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Geneswitch System vectors 

pGene/V5-His A, B, C; pSwitch 

5 Ecdysone-Inducible System 

PVgRXR; pIND; plND(SPl); pDND/V5-His; pEMD/V5-His-TOPO; pIND/GFP; 
pIND(SPl)/GFP. 

10 PShooter vectors 

pRF/Myc/Nuc; pCMV/Myc/nuc; pEF/myc/mito; pCMV/myc/mito; pEF/myc/ER; 
pCMV/myc/ER; pEF/myc/cyto; pCMV/myc/cyto. 

15 Invitrogen INC 

pTet-off; pTet-on; ptTA-2/ /3 /4; pTet-tTS; pTRE2hyg 

PTRE2pur; pTRE2; pLP-TRE2; PTRE-Myc; pTRE-HA; pTRE-6xHN 

pTRE-d2EGFP; pBI; pBI-EGFP; pBI-G; pBI-L;pTK-Hyg 

20 

" Living colours" vectors. 

pDsRed2-Nl; pDsRed2-Cl; pECFP-Nl; pEGFP-Nl; pEGFP-N2; pEGFP-N3 
pEYFP-Nl; pECFP-Cl; pEGFP-Cl; pEGFP-C2; pEGFP-C3 
25 pEYFP-C 1 ; pdlEGFP-Nl ; pdlECFP-Nl ; pd2EGFP-Nl ; pd2EYFP-Nl 
pd4EGFP-Nl; pCMS-EGFP; pHygEGFP; pEGFPLuc; pNF-KB-dsEGFP 
pIRES2-EGFP; pIRES-EYFP 

Maintenance of cell lines 

30 

All cells were grown in Dulbecco's modified Eagle's medium (DMEM), 
supplemented with 10% by volume foetal calf serum (Gibco BRL) and 2mM L- 
glutamine. Tissue culture flasks were incubated in a humidified atmosphere of 10% 
C0 2 in air at 37°C 
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Treatment of NTERA2 Cells 
Retinoic acid 

5 Medium was aspirated from confluent flasks of EC cells and the cells rinsed in sterile 
PBS. 1ml of 0.25% (w/v) trypsin in 2mM EDTA was added per 75cm 2 flask and the 
flask incubated at room temperature for up to 5 minutes. Vigorous shaking was 
subsequently used to dislodge the cells. Cells were suspended in 9ml of 
supplemented DMEM per ml of trypsin used and counted in a haemocytometer. Cells 
10 were seeded at 10 6 cells per 75cm 2 flask, in medium containing 10" 5 M all-trans- 
retinoic acid (Eastman Kodak), diluted from a 10~ 2 M stock solution in dimethyl 
sulfoxide (DMSO). Flasks were incubated as described above and the media replaced 
as and when required. 

15 Hexamethylene bisacetamide (HMBA) 

Cells to be treated with HMBA were prepared as described .for retinoic acid, but 
grown in medium supplemented with 1 0" 3 M HMBA instead of RA. 

Harvesting of cells 

20 Cells were dislodged from the culture vessel with trypsin and suspended in 9ml 

■ 

culture medium per ml of trypsin solution used, as described above. The cell 
suspension was then centrifuged at 400 x g for 3 minutes and the medium aspirated 
from the resulting cell pellet. Cells were then rinsed in 5ml PBS and centrifuged 
again at 400 x g for 1 minute. The PBS rinse was aspirated and the cells stored at — 
25 80°C or used immediately. 

Total RNA preparation 

Where possible, all vessels and all solutions used in RNA preparation and storage 
30 were treated with a 0.01% (v/v) solution of diethylpyrocarbonate (DEPC) in distilled 
water, and subsequently autoclaved. 
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TRI reagent (Sigma) was added to pelleted cells in a quantity corresponding to 1ml 
per 75cm 2 flask. The lysate was agitated until homogenous. 0.2ml of chloroform was 
added per ml of TRI reagent used and the vessel vortexed for 10 seconds. After 10 
minutes at room temperature, the lysate was centrifuged' at 12000 x g for 15 minutes 
5 at 4°C. Following centrifiigation, the aqueous (uppermost) phase was transferred to a 
fresh vessel and 0.5ml of isopropanol added per ml of TRI reagent used. The sample 
was incubated at room temperature for 10 minutes, then centrifuged at 12000 x g for 
10 minutes at 4°C. Following centrifiigation, the supernatant was removed and the 
pellet washed in 70% ethanol. RNA was dissolved in DEPC-treated, double-distilled 
1 0 water. 

Isolation of mRNA 

lOOmg oligo dT cellulose (Ambion) was suspended in 25ml binding buffer. Up to 
15 2mg of total RNA was then added to the binding buffer and the suspension gently 
agitated at room temperature for 45 minutes. The suspension was then centrifuged at 
3000 x g for 10 minutes and the supernatant discarded. The resulting pellet was re- 
suspended in a further 25ml of binding buffer and agitated at room temperature for 
60 minutes. The suspension was again centrifuged at 3000 x g and the supernatant 
20 discarded. The pellet of oligo dT cellulose was transferred to a spin column using a 
minimal quantity of binding buffer to re-suspend. The column was spun at maximum 
speed in a desktop microfuge for 30 seconds, and the eluate discarded. This was 
repeated until the cellulose was dry. 200jlx1 of wash buffer was then added to the 
cellulose and mixed in with a pipette tip. The column was spun at maximum speed 
25 for 1 minute and the eluate discarded. 200jal of DEPC-treated, double-distilled H2O 
was then added to the cellulose and mixed in 5 as before. The column was then spun at 
maximum speed for 2 minutes and the eluted mRNA collected. 

Precipitation of RNA 

30 

To the RNA solution was added O.lx volume of 5M LiCl and 2.5x volume of 100% 
ethanol. After vortexing briefly, the sample was incubated at — 20°C for >60 minutes 
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to precipitate. Precipitated RNA was centrifuged at maximum speed in a bench top 
microfuge for 30 minutes. The supernatant was discarded and the pellet rinsed in 
70% ethanol, then dissolved in H2O. 
Quantitation of nucleic acid 

5 

A Beckman DU 650 spectrophotometer was used for the quantitation of both DNA 
and RNA. The machine was set to measure absorbence at wavelengths of 260nm and 
280nm. After blanking the machine on an appropriate solution, diluted DNA or RNA 
samples in a volume of lOOjal were added to the cuvette and measured. The 
10 absorbence at 260nm was used to calculate nucleic acid concentration in p,g/(al 3 as 
shown below: 

[Nucleic acid] = (A 260 x N x DF) 4- 1000 

15 Where N is 33 for single-stranded DNA, 50 for double-stranded DNA and 40 for 
RNA and DF is the dilution factor for the sample added to the cuvette. 

Northern blot analysis 
Blot preparation 

20 lg of agarose was dissolved in 85ml H 2 0 by boiling. After cooling to around 70°C, 
10ml of lOx MOPS buffer and 5ml of formaldehyde were added, and the gel cast. 1- 
5|j,g of each mRNA sample was mixed with an appropriate quantity of lOx RNA 
loading buffer to give a final volume of no more than 30jul The RNA was then 
denatured at 95°C for 2 minutes and quenched on ice for 10 minutes. The gel was 

25 placed in an electrophoresis tank containing lx MOPS buffer and the samples loaded 
into each well of the gel, along with appropriate molecular weight markers in the 
outermost wells. 80V were applied across the gel for 2-3 hours or as required. 
Following electrophoresis, the outermost lanes containing the molecular weight 
markers were removed using a scalpel and submerged in double-distilled H 2 0 

30 containing ethidium bromide at 0.5jag/ml. The remainder of the gel was submerged in 
>5 volumes of double-distilled H2O, which was replaced every 5 minutes for a total 
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of 25 minutes. An appropriately sized piece of GeneScreen Plus (DuPont) membrane, 
just larger than the area of gel to be blotted, was cut. The membrane was hydrated by 
briefly submerging in double-distilled H 2 0, then transferred to lOx SSC, concurrent 
with the last 15 minutes of gel washing. The blotting apparatus was assembled as 
5 shown in Figure 2.1 3 with the gel upside-down, using lOx SSC transfer buffer. After 
transfer of at least 6 hours, the absorbent material was removed from the membrane. 
After marking the position of the wells using a pencil, the membrane was removed 
from the gel and washed briefly in 2x SSC. Whilst still damp, the RNA was fixed to 
the membrane by UV crosslinking. The membrane was then baked at 80°C for 3 
10 hours. 

The* excised marker lanes were de-stained by soaking in a large volume of double- 
distilled H 2 0 for around 3 hours, after which they were visualised on a UV 
transilluminator and photographed. 

15 

Probe preparation 

Random-primed DNA labelling was carried out using the Prime-a-Gene kit from 
Promega. Approximately 25ng of template DNA (PGR or restriction digest product) 
20 was denatured at 95°C for 2 minutes, then quenched on ice for 10 minutes. The 
reaction mix was then assembled on ice, in the order indicated below: 

10\xl of 5x labelling buffer 
H 2 0 to give a final volume of 50|al 
25 2jll1 unlabelled dNTP mix (0.5mM each) 

25ng of denatured/quenched template DNA 
2\il lOmg/mlBSA 

5jli1 aP 32 dATP 3000Ci/mmol (NEN DuPont) 
1 fj.1 DNA polymerase 1 large (Klenow) fragment 

30 
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The labelling reaction mix was incubated at room temperature for 2 hours. After this 
period, unincorporated nucleotides were removed using Pharmacia S-300 MicroSpin 
columns. Columns were placed in a microfiige tube and pre-spun at 735 x g for 1 
minute. The column was then transferred to a fresh tube and the entire labelling 
5 reaction added. The column was then spun at 735 x g for a further 2 minutes and the 
purified, labelled DNA collected. Labelled DNA was denatured at 95°C for 2 
minutes, then quenched on ice for 15 minutes. , 

Hybridisation and washing procedure 

10 

Northern blots were equilibrated in 150ml of 2x SSC at 42°C for 15 minutes in a 
hybridisation oven at 8 RPM. The SSC was exchanged for 25ml of hybridisation 
buffer, pre-warmed to 42°C, and the filter incubated for a further 30 minutes at the 
same temperature. The entire volume of purified probe solution was then added to 
15 the hybridisation buffer and the blot incubated overnight at 42°C/ 8 RPM. The 
hybridisation solution was then discarded and the blot washed as follows: 

2x SSC at room temperature for 20 minutes 
2x SSC at room temperature for 20 minutes 
20 2x SSC/1% SDS at 65°C for 45 minutes 

2x SSC/1% SDS at 65°C for 45 minutes 
O.lx SSC at room temperature for 20 minutes 
O.lx SSC at room temperature for 20 minutes 

25 Filters were exposed to a Bio Rad BI phosphor-imager screen overnight and, in most 
cases, subsequently exposed to X-ray film (Kodak X-omat AR). 

Loading controls for Northern blots 

30 All Northern blots used in this study were probed with p-actin as a loading control. 
Table 2.5 (overleaf) lists the figures to which each control probing (panel A to T 5 
Figure 2.2) corresponds. Northern blot data presented in this study have not, in all 
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cases, been subject to repeat experiments using RNA isolated from different batches 
of cells. These data may not be regarded as conclusive, since reproducibility has not 
been proven. 

5 Method for Analysis of the Requirement for Notch Ligands in the 
Differentiation of Embryonic Stem, Embryonal Carcinoma and their 
Differentiated Derivatives. 

CHO are transfected with constructs encoding either membrane bound or soluble 
10 forms of the Notch ligands. These cell lines are used to support the growth of either 
Embryonal carcinoma cells (EC) e.g NTERA2/cl.Dl or Human embryonic stem cells 
(hES). 

The transfected CHO cells (CHO(DSL)) are used in the following way. To assess 
1 5 membrane bound forms of the Notch ligands the CHO(DSL) cells are used as feeder 
cells (i.e. the EC or hES will be grown on top of the CHO(DSL) cells). To assess 
the soluble forms of the Notch ligands either supernatant from the transfected CHO 
cells or concentrated ligand molecules derived from the supernatant are added to the 
culture medium of the EC and hES cells. 

20 

Notch Ligand Constructs. 

The following cloned Notch ligands were obtained from Dr. Shigeru Chiba, 
Department of Hematology, Oncology and Cell Therapy, Transplantation Medicine. 
25 Graduate School of Medicine. University of Tokyo. 

Deltal-FLAG 

Jaggedl-FLAG 

Jagged2-FLAG 

30 

Soluble Deltal-Fc 
Soluble Jagged 1-Fc 
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Soluble Jagged2-Fc 

These had been cloned into the vector pTRACER-CMV jfrom Invitrogen, Fig 30). 

5 The clones used consisted either of the full length ligand linked to a histidine tag 
(FLAG, Kodak Inc.), or a ligand lacking the membrane spanning and intracellular 
portion of the protein thus rendering the ligand soluble. These had been linked to the- 
Fc portion of human IgG. 

10 Generation of Notch Ligand expressing Cell lines 

The Chinese Hamster Ovary derived cell line AA8 was maintained in MEM Alpha 
medium with Glutamax-1 supplemented with ribonucleosides and 
deoxyribonucleosides (Lifetechnologies) and 10% Foetal Bovine Serum 
1 5 (FBS)(Lifetechnologies). 

Plasmid was transfected into the AA8 cells using either Fugene (Roche) or Lipofectin 
(Lifetechnologies) or Superfect (Qiagen) according to manufacturers protocols. 

20 Assessment of Transiently Transfected Cell lines for Ligand Production. 

Both soluble and membrane bound forms of the Notch ligand's production are 
assayed by western blotting and chemiluminesent detection. 

25 Cells transfected with the ligand encoding constructs are harvested and the proteins 
extracted. Due to the tagging of the ligands proteins are able to be run out on an 
SDS-PAGE gel 5 blotted and probed with either mouse anti-FLAG antibody and 
detected using a anti-mouse HRP secondary or an HRP-secondary antibody. Both 
methods use electro-chemiluminecence (ECL) as the detection method, 

30 
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Concentration of Soluble Notch ligand from the Supernatant of Transfected 
CHO cells. 

■ 

Folabelled Notch ligand can be purified from transfected CHO cells supernatant 
5 using a HiTrap protein G HP column (Amersham Pharmacia Biotech). A sample can 
be analysed by western blotting as described above. 

Embryonic Cell culture. 

10 Human Embryonal Carcinoma NTERA2/D1 cells are maintained in Dulbecco's 
modified Eagles medium (DMEM), supplemented with 2mM l-glutamine 5 10% 
Foetal Bovine Serum (Lifetechnologies) and at 37°C under 10% C0 2 in air. Cells 
were passaged by scraping from the surface of the tissue culture flask with 3mm glass 
beads and reseeded at 5 x 10 6 cells per 75cm 3 flask. For specific seeding densities 

15 cells were pasaaged using 0.25% Trypsin (Lifetechnologies) in Dulbecco's Phosphate 
Buffered Saline (PBS) supplemented with ImM EDTA. 

Human Embryonic Stem Cells are maintained on irradiated mouse embryonic 
fibroblasts in serum free conditions, with 80% F12:DMEM (Lifetechnologies), 20% 
20 Knockout SR (Lifetechnologies), 1% Non-essential amino acid solution 
(Lifetechnologies), 1 mM L-glutamine, O.lmM P-mercaptoetanol (Sigma) 4 ng/ml 
bFGF (Sigma). The cells are passaged using collagenase IV and scraping. 
Flow Cvtofluorimetry 

_* 

25 Cells were removed from their adherent culture surface and incubated with suitable 
primary antibody for 1 hour at 4C. Cells are washed in PBS with 5% FCS and 
incubated for a further hour with a suitable FITC-conjugated labelled secondary 
antibody, and analysed on a EPICS Elite ESP Flow Cytometer (Coulter Electronics). 
Colonies were assessed for the presence of embryonal stem cell markers such as 

30 SSEA-3, -4, Tra-1 -60 and for appearance of markers of differentiated marker 
antigens such as A2B5, ME3 1 1 and N90 1 . 
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Design of oligonucleotide primers 

Primers for use in PCR were designed on a Macintosh Power PC, using the "Primer 
Select" program of the DNASTAR software package (DNASTAR Inc.). All primers 
5 used in this study are shown in Table 2 



Table 2 List of oligonucleotide primers 



Gene 


GenBank 
accession 


Primer direction 


Prinmer 
location 


Primer sequence 5' to 3" 












Wnt-13 


271621 


Forward 


1159-1178 


Tgagtggttcctgtactctg 


Reverse 


1503-1484 


Actcacactgggtaacacgg 


SFRP4 


XM_004706 


Forward 


858-880 


Agaggagtggctgcaatgaggtc 


Reverse 


1159-1142 


Gcgcccggctgttttctt 


Wafl 


U03106 


Forward 


487-506 


Cagggtcgaaaacggcggca 


Reverse 


947-928 


Aggagccacacccctccaga 


p-actin 


NMJD01101 


Forward 


326-357 


Atctggcaccacaccttctacaatgagctgc 
9 


Reverse 


1 1 63-1 1 32 


Cgtcatactcctgcttgctgatccacatctgc 


neuroDI 


NMJ)02500 


Forward 


24.0.-263 


Aagccatgaacgcagaggaggact 


Reverse 


818-799 


Agctgtccatggtaccgtaa 



All PCR data presented in this study were duplicated in independent experiments to 
10 eliminate the possibility of methodological error. However, duplicate experiments 
were performed on identical samples and do not, therefore, control for variability 
between separate batches of cells. Polymerase chain reactions from which 
quantitative interpretations were to be made were controlled by parallel amplification 
of the cyclin-dependent kinase inhibitor, Wafl, This transcript has been demonstrated 
15 by other workers in the laboratory to be constitutively expressed by NTERA2 EC 
cells and differentiated derivatives (unpublished data). Furthermore, Wafl has been 
shown to exhibit an approximately 20-fold lower abundance in the NTERA2 system 
than the more widely used control, (3-actin, and is therefore well suited to the analysis 
of rare transcripts. 

20 

PCR Reaction conditions 

PCR mixes were assembled on ice, with the following components per reaction: 
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5jalof25mMMgCl 2 
5jul of lOx reaction buffer 
5jal of lroM dNTPs 
3jal of forward primer at 5pmol/|al 
5 3 pi of reverse primer at 5pmol/|-il 

O.SjLil of Taq polymerase at 1 unit/]Lil (Promega) 
template and H2O to give 50|ll1 final volume 

A premix was made containing all reaction components bar the template. Premix was 
10 then added to the reaction vessels containing the template, on ice. The reaction 
vessels were then transferred to the thermal cycler. The PCR programs used are 
shown in Table 3, with cycling from Tl -»T2-*T3 *T1 . 

Table 3 PCR thermal cycling programs 

15 

Program 1 Program 2 Program 3 Program 4 

Tl 96°C/30 seconds 94°C/60 seconds 94°C/90 seconds 95°C/90 seconds 
(temp/ duration) 

T2 50°C/15 seconds 55°C/90 seconds 60°C/90 seconds 63°C/60 seconds 
(temp/ duration) 

T3 60°C/240 seconds 72°C/60 seconds 72°C/120 seconds 72°C/60 seconds 
(temp/duration) 

Cycles 25 35 35 35 
List of DNA and protein accession numbers of genes used in results 

20 



Gene 
Name 


Description 


cDNA Accession Number 


Protein Accession 
Number 


WNT2B 


wingless-type MMTV 
integration site family, 
member 2B 


AB045116 


Q93097 
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member 2B 






SFRP1 


secreted frizzled- 
related protein 1 


AF056087 


AAC12877 


SFRP4 


secreted frizzled- 
related protein 4 


AF026692 


AAC04617 


FRZB 

JL JL A f f ■ + 


frizzled-related nrotein 


NM 001463 


NP 001454 


SFRP2 


secreted frizzled- 
related nrotein 2 






FZD1 


frizzled (Drosophila) 
homoloe 1 


AB017363 


BAA34666 


FZD2 


frizzled (Drosophila) 
homoloe 2 


NM_001466 


NPJ301457 


FZD9 


frizzled (Drosophila) 
homoloe 9 

J- -JL * — ' JL_.JL.JL %— ' JL. t— — j — ^ 


HSU82169 


AAC51174 


FZD3 


frizzled (Drosophila) 
homoloe 3 


Kirikoshi et. al, 2000 


Kirikoshi et. al., 
2000 


FZD5 


frizzled (Drosophila) 
homoloe 5 






FZD4 


frizzled (Drosophila) 
homolog 4 


NM_012193 


NP 036325 


FZD6 


frizzled TDrosoDhila^ 
homolog 6 


AB012911 


BAA25686 


FZD7 


frizzled (Drosophila) 
homoloe 7 


AB017365 


BAA34668 


DVL2 


dishevelled 2 
ihornoloeous to 
Drosonhila dsh) 


NM 004422 


NP_004413 


DVL3 


dishevelled 3 
(homologous to 
Drosophila dsh) 


NM 004423 


NP_004414 


GSK3B 


glycogen synthase 
kinase 3 beta 

> 


NM_002093 


NP_002084 


AXIN1 


axin 


AF009674 


AAC51624 


APC 


adenomatosis 
polyposis coli 


NM_O0OO38 


NP_000029 


TCF1 


transcription factor 1 , 
hepatic; LF-Bl, 
hepatic nuclear factor 
(HjNFI), albumin 
proximal factor 


M57732 


AAA88077 



Examples 

Expression of a single Wnt gene, Wnt-13(2B) was detected. This transcript was 
absent in NTERA2 EC cells, but showed marked up-regulation following RA 
5 treatment, figure 24. Members of the FRP family, encoding putative Wnt antagonists, 
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also showed altered expression during differentiation, figure 24. Both Frp-1 and 
SARP-1 were down-regulated following RA treatment, whilst FrpHE was absent in 
EC cells, but expressed at high levels in RA treated cultures. 

5 Several members of the frizzled family were also detected, providing a candidate 
receptor system for Wnt-13, figure 24. Two of these, hFz-4 and hFz-6, showed 
developmental regulation. Transcripts corresponding to intracellular components of 
the Wnt pathway, including Dishevelled, GSK-3b, Axin, APC and TCF were present 
at equivalent levels in EC and differentiating cultures. CBP was also ubiquitously 
10 expressed. 
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CLAIMS 



1. A method to modulate the differentiation of an embryonic stem cell 
5 comprising: 

i) providing a culture of embryonic stem cells; 

ii) providing at least one ligand, or the active binding fragment thereof, capable 
of binding its cognate receptor polypeptide expressed by said embryonic stem 
cell; 

10 iii) forming a culture comprising embryonic stem cells and said ligand; and 
iv) growing said cell culture. 



2. A method according to Claim 1 wherein said ligand is encoded by a nucleic 
acid molecule selected from the group consisting of: 
15 i) a nucleic acid molecule as represented in Figure 22; 

ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and 
which encodes a ligand capable of binding a Wnt receptor; and 

iii) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (i) and (ii) above. 



3. A method according to Claim 2 wherein said ligand is encoded by a nucleic 
acid molecule selected from the nucleic acid sequences represented in: Fig 30; Fig 
32; Fig 34; Fig 36; Fig 38; Fig 40; Fig 42; Fig 44; Fig 47; Fig 49; Fig 51; Fig 53; Fig 
55. 



4. A method according to Claim 2 or 3 wherein said ligand is encoded by a 
nucleic acid molecule as represented by the nucleic acid sequence in Fig 22. 
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5. A method according to Claim 1 wherein said ligand is encoded by a nucleic 
acid molecule selected from the group consisting of: 

i) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, or 
18. 

5 ii) a nucleic acid molecule which hybridises to the nucleic acid in (i) and which 

encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

iii) nucleic acid molecules which are degenerate as a result of the genetic code to 
the sequences defined in (i) and (ii) above. 



10 



6. A method according to Claim 5 wherein said ligand is selected from the 
group comprising the amino acid sequences in Figures 3, 6, 8, 9, 11, 13, 15, 17, 19, 
or polypeptide variants thereof. 



7. A method according to any of Claims 1-6 wherein said cells are induced to 
differentiate by the addition of at least one agent selected from the group consisting 
of: retinoic acid; hexamethylene bisacetamide; bone morphogenetic proteins; 
bromodeoxyuridine; lithium; sonic hedgehog. 



8. A method for modulating the differentiation of embryonic stem cells 
comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented in Figures 2, 4, 5, 7, 10, 12, 14, 16, 18. 

b) a nucleic acid molecule which hybridises to the nucleic acid in (ii) and which 
encodes a ligand capable of modulating embryonic stem cell differentiation; 
and 

c) nucleic acid molecules which are degenerate as a result of the genetic code to 
the sequences defined in (a) and (b) above. 

ii) forming a culture comprising the cell identified in (i) above with an 
embryonic stem cell; and 
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iii) growing said culture under conditions suitable for the maintenance and/or 
differentiation of said embryonic stem cell. 



9. A method for modulating the differentiation of embryonic stem cells 
5 comprising: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
group consisting of: 

a) a nucleic acid molecule as represented by the sequence in Figure 22; 

b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
10 which encodes a ligand capable ^of binding a Wnt receptor; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

ii) forming a culture comprising a cell identified in (i) above with an embryonic 
stem cell; and 

15 iii) growing said culture under conditions suitable for the maintenance and/or 

differentiation of embryonic stem cells. 



10. A method according to Claim 9 wherein said cell expresses Wnt- 1 3 ligand. 



11. A method according to any of Claims 9 or 10 wherein said cells are induced 
to differentiate by the addition of at least one agent selected from the group 
consisting of: retinoic acid; hexamethylene bisacetamide; bone morphogenetic 
proteins; bromodeoxyuridine; lithium; sonic hedgehog. 

12. A method according to any of Claims 1-11 wherein said nucleic acid 
molecule encodes a ligand of human origin. 

13. A method according to any of Claims 1-12 wherein said embryonic stem cells 
are of human origin. 

14. A method according to any of Claims 8-13 wherein said transfected cell is a 
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mammalian cell. 

15. A cell according to Claim 14 wherein said cell is selected from the group 
consisting of: a Chinese hamster ovary cell; murine primary fibroblast cell; human 

5 primary fibroblast cell; transformed mouse fibroblast cell-line STO. 

16. A method for inhibiting the differentiation of embryonic stem cells 
comprising the steps of: 

i) providing at least one polypeptide, or active fragment thereof, wherein said 
10 polypeptide is an inhibitor of the Wnt signalling pathway. 

iii) forming a culture comprising the polypeptide identified in (i) above with an 

embryonic stem cell; and 
iii) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 

15 

17. A method according to Claim 16 wherein said inhibitor is selected from the 
group consisting of the active binding fragments thereof of the following 
polypeptides: frizzled related polypeptides (FRP); Wnt Inhibitory Factors (WEF); 
Dickkopf; Cerebrus. 

20 

18. A method according to Claim 17 wherein said inhibitor is encoded by a 
nucleic acid molecule selected from the nucleic acid sequences represented by: Fig 
57; Fig 59; Fig 61; Fig 63; Fig 65; Fig 67; Fig 69; Fig 71; Fig 73; Fig 75; Fig 77; Fig 
79; Fig 81; Fig 83; Fig 85; Fig 87; Fig 89; Fig 91; Fig 93; Fig 95; Fig 97; Fig 99; Fig 

25 101; or Fig 103. 

19. A method for inhibiting the differentiation of embryonic stem cells 
comprising the steps of: 

i) providing a cell transfected with a nucleic acid molecule selected from the 
30 group consisting of: 

a) a nucleic acid molecule encoding a Wnt inhibitory polypeptide; 
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b) a nucleic acid molecule which hybridises to the nucleic acid in (a) and 
which encodes a polypeptide capable of inhibiting Wnt signalling; and 

c) nucleic acid molecules which are degenerate as a result of the genetic 
code to the sequences defined in (a) and (b) above. 

5 ii) forming a culture of the cell identified in (i) above with an embryonic stem 

cell; and 

hi) growing said culture under conditions suitable for the maintenance of 
embryonic stem cells in an undifferentiated state. 

10 20. A method according to Claim 19 wherein said cells express at least one Wnt 
inhibitory polypeptide selected from the group consisting of the active binding 
fragments thereof of the following polypeptides: frizzled related polypeptides (FRP); 
Wnt Inhibitory Factors (WIF); Dickkopf; Cerebrus. 

15 21. A method according to Claim 19 wherein said cells express at least one Wnt 
inhibitory polypeptide encoded by a nucleic acid molecule selected from the nucleic 
acid sequences represented by : Fig 57; Fig 59; Fig 61; Fig 63; Fig 65; Fig 67; Fig 69; 
Fig 71; Fig 73; Fig 75; Fig 77; Fig 79; Fig 81; Fig 83; Fig 85; Fig 87; Fig 89; Fig 91; 
Fig 93; Fig 95; Fig 97; Fig 99; Fig 101; Fig or 103. 

20 

22. A cell or cell culture obtainable by the method according to any of Claims 1- 
21. 

23. A therapeutic cell composition obtainable by the method according to any of 
25 Claims 1-15. 

24. Use of a cell according to Claim 23 for the manufacture of a composition for 
use in the treatment of a disease selected from the group consisting of: 
Parkinson's disease; Huntington's disease; motor neurone disease; heart disease; 

30 diabetes; liver disease (eg cirrhosis); renal disease; AIDS. 



> 



53 



WO 02/077204 



PCT/GB02/01195 



25. A method of treatment of an animal, preferably a human, comprising 
administering a cell composition comprising embryonic stem cells which have been 
induced to differentiate into at least one cell-type by the method according to any of 

5 Claims 1-14. 

26. Condition medium obtained by culturing embryonic stem cells according to 
the method of any of Claims 1-21. 

10 
15 
20 
25 
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Figure 2 

GTCCAGCGGTACCATGGGCCGTCGGAGCGCGCTAGCCCTTGCCGTGGTCTCTGCCCTGC 

TGTGCCAGGTCTGGAGCTCCGGCGTATTTGAGCTGAAGCTGCAGGAGTTCGTCAACAA 

GAAGGGGCTGCTGGGGAACCGCAACTGCTGCCGCGGGGGCTCTGGCCCGCCTTGCGCC 

TGCAGGACCTTCTTTCGCGTATGCCTCAAGCACTACCAGGCCAGCGTGTCACCGGAGCC 

ACCCTGCACCTACGGCAGTGCTGTCACGCCAGTGCTGGGTGTCGACTCCTTCAGCCTGC 

CTGATGGCGCAGGCATCGACCCCGCCTTCAGCAACCCCATCCGATTCCCCTTCGGCTTC 

ACCTGGCCAGGTACCTTCTCTCTGATCATTGAAGCCCTCCATACAGACTCTCCCGATGA 

CCTCGCAACAGAAAACCCAGAAAGACTCATCAGCCGCCTGACCACACAGAGGCACCTC 

ACTGTGGGAGAAGAATGGTCTCAGGACCTTCACAGTAGCGGCCGCACAGACCTCCGGT 

ACTCTTACCGGTTTGTGTGTGACGAGCACTACTACGGAGAAGGTTGCTCTGTGTTCTGC 

CGACCTCGGGATGACGCCTTTGGCCACTTCACCTGCGGGGACAGAGGGGAGAAGATGT 

GCGACCCTGGCTGGAAAGGCCAGTACTGCACTGACCCAATCTGTCTGCCAGGGTGTGA 

TGACCAACATGGATACTGTGACAAACCAGGGGAGTGCAAGTGCAGAGTTGGCTGGCAG 

GGCCGCTACTGCGATGAGTGCATCCGATACCCAGGTTGTCTCCATGGCACCTGCCAGC 

AACCCTGGCAGTGTAACTGCCAGGAAGGCTGGGGGGGCCTTTTCTGCAACCAAGACCT 

GAACTACTGTACTCACCATAAGCCGTGCAGGAATGGAGCCACCTGCACCAACACGGGC 

CAGGGGAGCTACACATGTTCCTGCCGACCTGGGTATACAGGTGCCAACTGTGAGCTGG 

AAGTAGATGAGTGTGCTCCTAGCCCCTGCAAGAACGGAGCGAGCTGCACGGACCTTGA 

GGACAGCTTCTCTTGCACCTGCCCTCCCGGCTTCTATGGCAAGGTCTGTGAGCTGAGCG 

CCATGACCTGTGCAGATGGCCCTTGCTTCAATGGAGGACGATGTTCAGATAACCCTGAC 

GGAGGCTACACCTGCCATTGCCCCTTGGGCTTCTCTGGCTTCAACTGTGAGAAGAAGAT 

GGATCTCTGCGGCTCTTCCCCTTGTTCTAACGGTGCCAAGTGTGTGGACCTCGGCAACT 

CTTACCTGTGCCGGTGCCAGGCTGGCTTCTCCGGGAGGTACTGCGAGGACAATGTGGA 

TGACTGTGCCTCCTCCCCGTGTGCAAATGGGGGCACCTGCCGGGACAGTGTGAACGAC 

TTCTCCTGTACCTGCCCACCTGGCTACACGGGCAAGAACTGCAGCGCCCCTGTCAGCAG 

GTGTGAGCATGCACCCTGCCATAATGGGGCCACCTGCCACCAGAGGGGCCAGCGCTAC 

ATGTGTGAGTGCGCCCAGGGCTATGGCGGCCCCAACTGCCAGTTTCTGCTCCCTGAGCC 

ACCACCAGGGCCCATGGTGGTGGACCTCAGTGAGAGGCATATGGAGAGCCAGGGCGG 

GCCCTTCCCCTGGGTGGCCGTGTGTGCCGGGGTGGTGCTTGTCCTCCTGCTGCTGCTGG 

GCTGTGCTGCTGTGGTGGTCTGCGTCCGGCTGAAGCTACAGAAACACCAGCCTCCACCT 

GAACCCTGTGGGGGAGAGACAGAAACCATGAACAACCTAGCCAATTGCCAGCGCGAG 

AAGGACGTTTCTGTTAGCATCATTGGGGCTACCCAGATCAAGAACACCAACAAGAAGG 

CGGACTTTCACGGGGACCATGGAGCCAAGAAGAGCAGCTTTAAGGTCCGATACCCCAC 

TGTGGACTATAACCTCGTTCGAGACCTCAAGGGAGATGAAGCCACGGTCAGGGATACA 

CACAGCAAACGTGACAGCAAGTGCCAGTCACAGAGCTCTGCAGGAGAAGAGAAGATC 

GCCCCAACACTTAGGGGTGGGGAGATTCCTGACAGAAAAAGGCCAGAGTCTGTCTACT 

CTACTTCAAAGGACACCAAGTACCAGTCGGTGTATGTTCTGTCTGCAGAAAAGGATGA 

GTGTGTTATAGCGACTGAGGTGTAAGATGGAAGCGATGTGGCAAAATTCCCATTTCTCT 

CAAATAAAATTCCAAGGATATAGCCCCGATGAATGCTGCTGAGAGAGGAAGGGAGAG 

GAAACCCAGGGACTGCTGCTGAGAACCAGGTTCAGGCGAAGCTGGTTCTCTCAGAGTT 

AGCAGAGGCGCCCGACACTGCCAGCCTAGGCTTTGGCTGCCGCTGGACTGCCTGCTGG 

TTGTTCCCATTGCACTATGGACAGTTGCTTTGAAGAGTATATATTTAAATGGACGAGTG 

ACTTGATTCATATAGGAAGCACGCACTGCCCACACGTCTATCTTGGATTACTATGAGCC 

AGTCTTTCCTTGAACTAGAAACACAACTGCCTTTATTGTCCTTTTTGATACTGAGATGTG 

TTTTTTTTTTTCCTAGACGGGAAAAAGAAAACGTGTGTTATTTTTTTGGGATTTGTAAAA 

ATATTTTTCATGATATCTGTAAAGCTTGAGTATTTTGTGACGTTCATTTTTTTATAATTT 

AAATTTTGGTAAATATGTACAAAGGCACTTCGGGTCTATGTGACTATATTTTTTTGTAT 

ATAAATGTATTTATGGAATATTGTGCAAATGTTATTTGAGTTTTTTACTGTTTTGTTAAT 

GAAGAAATTCATTTTAAAAATATTTTTCCAAAATAAATATAATGAACTACA 
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Figure 3 

MGRRSALALAWSALLCQVWSSGWELKXQEFVNKKGLLGNRNCCRGGSGPPCACRTFFR 

VCLKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGAGIDPAFSNPIRFPFGFXWPGTFSLIIE 

ALHTDSPDDLATENPERLISRLTTQRHLTVGEEWSQDLHSSGRTDLRYSYRFVCDEHYYGE 

GCSWCRPRDDAFGHFTCGDRGEKMCDPGWKGQYCTDPICLPGCDDQHGYCDKPGECKC 

RVGWQGRYCDECIRYPGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHHKPCRNGAT 

CTNTGQGSYTCSCRPGYTGANCELEVDECAPSPCKNGASCTDLEDSFSCTCPPGFYGKVCE 

LSAMTCADGPCFNGGRCSDNPDGGYTCHCPLGFSGFNCEKKMDLCGSSPCSNGAKCVDL 

GNSYLCRCQAGFSGRYCED]SrVDDCASSPCANGGTCRDSVMDFSCTCPPGYTGKNCSAPVS 

RCEHAPCHNGATCHQRGQRYMCECAQGYGGPNCQFLLPEPPPGPMVVDLSERHMESQGG 

PFPWAYCAGVVLVLLLLLGCAAVWCVRLKLQKHQPPPEPCGGETETMNNLANCQREK 

DVSVSnGATQIKNTNKXADFHGDHGAK^SFKWYPTVDY]<nLVRDLKGDEATVRDTHSK 

RDTKCQSQSSAGEEKIAPTLRGGEIPDRKRPESVYSTSKDTKYQSVYVLSAEBCDECVIATEV 



Figure 4 

CGGGCAGAGGTGGAAGAGGGGGGAGCGCCTCAAAGAAGCGATCAGAATAATAAAAGG 
AGGCCGGGCTCTTTGCCTTCTGGAACGCGCGGCTCTTGAAAGGGCTTTTGAAAAGTAGT 
GTTGTTTTCCAGTCGTGCATGCTCCAATCCACGGAGTATATTAGAGCCGGGACGCGGCG 
GCCGCGGGGGCAGCGACGACGGCAGCCTCGGCGGGAGCACCAGCGCTAGCAGCGGCG 
GCGGCGTCCGGAGTGCCCGTGGCGCGCGGCGCAGCGATGCGGTCCCCACGGACGCGCG 
GCCGGCCCGGGCGCCCCCTGAGTCTTCTGCTCGCCCTGCTCTGTGCCCTGCGAGCCAAG 
GTGTGCGGGGCCTCGGGTCAGTTTGAGCTGGAGATCCTGTCCATGCAGAACGTGAATG 
GAGAGCTACAGAATGGGAACTGTTGTGGTGGAGTCCGGAACCCTGGCGACCGCAAGTG 
CACCCGCGACGAGTGTGATACGTACTTCAAAGTGTGCCTCAAGGAGTATCAGTCCCGC 
GTCACTGCCGGGGGACCCTGCAGCTTCGGCTCAGGGTCTACGCCTGTCATCGGGGGTA 
ACACCTTCAATCTCAAGGCCAGCCGTGGCAACGACCGTAATCGCATCGTACTGCCTTTC 
AGTTTCGCCTGGCCGAGGTCCTACACTTTGCTGGTGGAGGCCTGGGATTCCAGTAATGA 
CACTATTCAACCTGATAGCATAATTGAAAAGGCTTCTCACTCAGGCATGATAAACCCTA 
GCCGGCAATGGCAGACACTGAAACAAAACACAGGGATTGCCCACTTCGAGTATCAGAT 
CCGAGTGACCTGTGATGACCACTACTATGGCTTTGGCTGCAATAAGTTCTGTCGTCCCA 
GAGATGACTTCTTTGGACATTATGCCTGTGACCAGAACGGCAACAAAACTTGCATGGA 
AGGCTGGATGGGTCCTGATTGCAACAAAGCTATCTGCCGACAGGGCTGCAGTCCCAAG 
CATGGGTCTTGTAAACTTCCAGGTGACTGCAGGTGCCAGTACGGTTGGCAGGGCCTGT 
ACTGCGACAAGTGCATCCCGCACCCAGGATGTGTCCACGGCACCTGCAATGAACCCTG 
GCAGTGCCTCTGTGAGACCAACTGGGGTGGACAGCTCTGTGACAAAGATCTGAATTAC 
TGTGGGACTCATCAGCCCTGTCTCAACCGGGGAACATGTAGCAACACTGGGCCTGACA 
AATACCAGTGCTCCTGCCCAGAGGGCTACTCGGGCCCCAACTGTGAAATTGCTGAGCA 
. TGCTTGTCTCTCTGACCCCTGCCATAACCGAGGCAGCTGCAAGGAGACCTCCTCAGGCT 
TTGAGTGTGAGTGTTCTCCAGGCTGGACTGGCCCCACGTGTTCCACAAACATCGATGAC 
TGTTCTCCAAATAACTGTTCCCATGGGGGCACCTGCGAGGATCTGGTGAATGGATTCAA 
GTGTGTGTGCCCGCCCCAGTGGACTGGCAAGACTTGTCAGTTAGATGGAAATGAGTGC 
GAGGCCAAACCTTGTGTAAATGCCAGATCCTGTAAGAATCTGATTGCCAGCTACTACTG 
TGATTGCCTTCCTGGCTGGATGGGTCAGAACTGTGACATAAATATCAATGACTGCCTTG 
GCCAGTGTCAGAATGACGCCTCCTGTCGGGATTTGGTTAATGGTTATCGCTGTATCTGT 
CCACCTGGCTATGCAGGCGATCACTGTGAGAGAGACATCGATGAGTGTGCTAGCAA.ee 
CCTGCTTGAATGGGGGTCACTGTCAGAATGAAATCAACAGATTCCAGTGTCTCTGTCCC 
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ACTGGTTTCTCTGGAAACCTCTGTCAGCTGGACATCGATTACTGCGAGCCCAACCCTTG 

CCAGAATGGCGCCCAGTGCTACAATCGTGCCAGTGACTATTTCTGCAAGTGCCCCGAG 

GACTATGAGGGCAAGAACTGCTCACACCTGAAAGACCACTGCCGTACCACCACCTGCG 

AAGTGATTGACAGCTGCACTGTGGCCATGGCCTCCAACGACACGCCTGAAGGGGTGCG 

GTATATCTCTTCTAACGTCTGTGGTCCCCATGGGAAGTGCAAGAGCCAGTCGGGAGGC 

AAATTCACCTGTGACTGTAACAAAGGCTTCACCGGCACCTACTGCCATGAAAATATCA 

ACGACTGCGAGAGCAACCCCTGTAAAAACGGTGGCACCTGCATCGATGGCGTTAACTC 

CTACAAGTGTATCTGTAGTGACGGCTGGGAGGGAGCGCACTGTGAGAACAACATAAAT 

GACTGTAGCCAGAACCCTTGTCACTACGGGGGTACATGTCGAGACCTGGTCAATGACT 

TTTACTGTGACTGCAAAAATGGCTGGAAAGGAAAGACTTGCCATTCCCGTGACAGCCA 

GTGTGACGAAGCCACGTGTAATAATGGTGGTACCTGCTATGATGAAGTGGACACGTTT 

AAGTGCATGTGTCCCGGTGGCTGGGAAGGAACAACTTGTAATATAGCTAGAAACAGTA 

GCTGCCTGCCGAACCCCTGTCATAATGGAGGTACCTGCGTGGTCAATGGAGACTCCTTC 

ACCTGTGTCTGCAAAGAAGGCTGGGAGGGGCCTATTTGTACTCAAAATACCAACGACT 

GCAGTCCCCATCCTTGTTACAATAGCGGGACCTGTGTGGACGGAGACAACTGGTATCG 

GTGCGAATGTGCCCCGGGTTTTGCTGGGCCAGACTGCAGGATAAACATCAATGAGTGC 

CAGTCTTCCCCTTGTGCCTTTGGGGCCACCTGTGTGGATGAGATCAATGGCTACCAGTG 

TATCTGCCCTCCAGGACATAGTGGTGCCAAGTGCCATGAAGTTTCAGGGCGATCTTGCA 

TCACCATGGGGAGAGTGATACTTGATGGGGCCAAGTGGGATGATGACTGTAACACCTG 

CCAGTGCCTGAATGGACGGGTGGCCTGCTCCAAGGTCTGGTGTGGCCCGAGACCTTGC 

AGGCTCCACAAAAGCCACAATGAGTGCCCCAGTGGGCAGAGCTGCATCCCGGTCCTGG 

ATGACCAGTGTTTCGTGCGCCCCTGCACTGGTGTTGGCGAATGTCGGTCCTCCAGCCTC 

CAGCCAGTGAAGACCAAGTGCACATCTGACTCCTATTACCAGGATAACTGTGCAAACA 

TCACTTTCACCTTTAACAAAGAGATGATGTCTCCAGGTCTTACCACCGAACACATTTGC 

AGCGAATTGAGGAATTTGAATATCCTGAAGAATGTTTCTGCTGAATATTCGATCTACAT 

AGCCTGTGAGCCTTCCCTGTCAGCAAACAATGAAATACACGTGGCCATGTCTGCAGAA 

GACATCCGGGATGATGGGAACCCTGTCAAGGAAATTACCGATAAAATAATAGATCTCG 

TTAGTAAACGGGATGGAAACAGCTCACTTATTGCTGCGGTTGCAGAAGTCAGAGTTCA 

GAGGCGTCCTCTGAAAAACAGAACAGATTTCCTGGTTCCTCTGCTGAGCTCTGTCTTAA 

CAGTGGCTTGGGTCTGTTGCTTGGTGACAGCCTTCTACTGGTGTGTACGGAAGCGGCGG 

AAGCCCAGCAGCCACACTCACTCCGCCCCCGAGGACAACACCACCAACAATGTGCGGG 

AGCAGCTGAACCAAATCAAAAACCCCATCGAGAAACACGGAGCCAACACGGTCCCCA 

TTAAGGATTACGAGAACAAAAACTCCAAAATGTCAAAAATCAGGACACACAACTCGG 

AAGTGGAGGAGGATGACATGGATAAACACCAGCAGAAAGTCCGCTTTGCCAAACAGC 

CAGTGTATACGCTGGTAGACAGAGAGGAGAAGGCCCCCAGCGGCACGCCGACAAAAC 

ACCCGAACTGGACAAATAAACAGGACAACAGAGACTTGGAAAGTGCCCAGAGCTTGA 

ACCGGATGGAATACATCGTATAGCAGACAGTGGGCTGCCGCCATAGGTAGAGTTTGAG 

GGCACCGCGGGCCG 



Figure 5 

CTGCGGCCGGCCCGCGAGCTAGGCTGGGTTTTTTTTTTTCTCCCCTCCCTCCCCCCTTTT 

TCCATGCAGCTGATCTAAAAGGGAATAAAAGGCTGCGCATAATCATAATAATAAAAGA 

AGGGGAGCGCGAGAGAAGGAGAAAGCCGGGAGGTGGAAGAGGAGGGGGAGCGTCTC 

AAAGAAGCGATCAGAATAATAAAAGGAGGCGGCTCTTTGCCTTCTGGAACGGGCCGCT 

CTTGAAAGGGCTTTTGAAAAGTGGTGTTGTTTTCCAGTCGTGCATGCTCCAATCGGCGG 

AGTATATTAGAGCCGGGACGCGGCGGCCGCAGGGGCAGCGGCGACGGCAGCACGGCG 

GCAGCACCAGCGCGAACAGCAGCGGCGGCGTCCCGAGTGCCCGCGGCGCGCGGCGCA 

GCGATGCGTCCCCACGGACGCGCGGCCGGTCCGGGCGCCCCCTAAGCCTCCTGCTCGC 

CCTGCTCTGTGCCCTGCGAGCCAAGGTGTGTGGGGCCTCGGGTCAGTTCGAGTTGGAG 

ATCCTGTCCATGCAGAACGTGAACGGGGAGCTGCAGAACGGGAACTGCTGCGGCGGC 
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GCCCGGAACCCGGGAGACCGCAAGTGCACCCGCGACGAGTGTGAACATACTTCAAAGT 

GTGCCTCAAGGAGTATCAGTCCCGCGTCACGGCCGGGGGGCCCTGCAGCTTCGGCTC 

AGGGTCCACGCCTGTCATCGGGGGCAACACCTTCAACCTCAAGGCCAGCCGCGGCAAC 

GACCGCAACCCATCGTGCTGCCTTTCAGTTTCGCCTGGCCGAGGTCCTATACGTTGCTT 

GTGGAGGCGTGGGATTCCAGTAATGACACCGTTCAACCTGACAGTATTATTGAAAAGG 

CTTCTCACTCGGGCATGATCAACCCCAGCCGGCAGTGGCAGACGCTGAAGCAGAACAC 

GGGCGTTGCCCACTTTGAGTATCAGATCCGCGTGACCTGTGATGACTACTACTATGGCT 

TTGGCTGCAATAAGTTCTGCCGCCCCAGAGATGACTTCTTTGGACACTATGCCTGTG 

ACCAGAATGGCAACAAAACTTGCATGGAAGGCTGGATGGGCCCCGAATGTAACAGAG 

CTATTTGCCGAAAGGCTGCAGTCCTAAGCATGGGTCTTGCAAACTCCCAGGTGACTGCA 

GGTGCCAGTATGGCTGGCAAGGCCTGTACTGTGATAAGTGCATCCCACACCCGGGATG 

CGTCCACGGCATCTGTAATGAGCCCTGGCAGTGCCTCTGTGAGACCAACTGGGGCGGC 

CAGCTCTGTGACAAAGATCTCAATTACTGTGGGACTCATCAGCCGTGTCTCAACGGGG 

GAACTTGTAGCAACACAGGCCCTGACAAATATCAGTGTTCCTGCCCTGAGGGGTATTC 

AGGACCCAACTGTGAAATTGCTGAGCACGCCTGCCTCTCTGATCCCTGTCACAACAGA 

GGCAGCTGTAAGGAGACCTCCCTGGGCTTTGAGTGTGAGTGTTCCCCAGGCTGGACCG 

GCCCCACATGCTCTACAAACATTGATGACTGTTCTCCTAATAACTGTTCCCACGGGGGC 

ACCTGCCAGGACCTGGTTAACGGATTTAAGTGTGTGTGCCCCGCACAGTGGACTGGGA 

AAACGTGCCAGTTAGATGCAAATGAATGTGAGGCCAAACCTTGTGTAAACGCCAAATC 

CTGTAAGAATCTCATTGCCAGCTACTACTGCGACTGTCTTCCCGGCTGGATGGGTCAGA 

ATTGTGACATAAATATTAATGACTGCCTTGGCCAGTGTCAGAATGACGCCTCCTGTCGG 

GATTTGGTTAATGGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCACTGTGA 

GAGAGACATCGATGAATGTGCCAGCAACCCCTGTTTGAATGGGGGTCACTGTCAGAAT 

GAAATCAACAGATTCCAGTGTCTGTGTCCCACTGGTTTCTCTGGAAACCTCTGTCAGCT 

GGACATCGATTATTGTGAGCCTAATCCCTGCCAGAACGGTGCCCAGTGCTACAACCGT 

GCCAGTGACTATTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACC 

TGAAAGACCACTGCCGCACGACCCCCTGTGAAGTGATTGACAGCTGCACAGTGGCCAT 

GGCTTCCAACGACACACCTGAAGGGGTGCGGTATATTTCCTCCAACGTCTGTGGTCCTC 

ACGGGAAGTGCAAGAGTCAGTCGGGAGGCAAATTCAGCTGTGACTGTAACAAAGGCTT 

CACGGGAACATACTGCCATGAAAATATTAATGACTGTGAGAGCAACCCTTGTAGAAAC 

GGTGGCACTTGCATCGATGGTGTCAACTCCTACAAGTGCATCTGTAGTGACGGCTGGG 

AGGGGGCCTACTGTGAAACCAATATTAATGACTGCAGCCAGAACCCCTGCCACAATGG 

GGCACGTGTCGCGACCTGGTCAATGACTTCTACTGTGACTGTAAAAATGGGTGGAAAG 

GAAAGACCTGCCACTCACGTGACAGTCAGTGTGATGAGGCCACGTGCAACAACGGTGG 

CACCTGCTATGATGAGGGGGATCTTTTAAGTGCATGTGTCCTGGCGGCTGGGAAGGAA 

CAACCTGTAACATAGCCCGAAACAGTAGCTGCCTGCCCAACCCCTGCCATAATGGGGG 

CACATGTGTGGTCAACGGCGAGTCCTTTACGTGCGTCTGCAAGGAAGGCTGGGAGGGG 

CCCATCTGTGCTCAGAATACCAATGACTGCAGCCCTCATCCCTGTTACAACAGCGGCAC 

CTGTGTGGATGGAGACAACTGGTACCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCC 

GACTGCAGAATAAACATCAATGAATGCCAGTCTTCACCTTGTGCCTTTGGAGCGACCTG 

TGTGGATGAGATCAATGGCTACCGGTGTGTCTGCCCTCCAGGGCACAGTGGTGCCAAG 

TGCCAGGAAGTTTCAGGGAGACCTTGCATCACCATGGGGAGTGTGATACCAGATGGGG 

CCAAATGGGATGATGACTGTAATACCTGCCAGTGCCTGAATGGACGATCGCCTGCTCA 

AAGGTCTGGTGTGGCCCTCGACCTTGCCTGCTCCACAAAGGGCACAGCGAGTGCCCCA 

GCGGGCAGAGCTGCATCCCCATCCTGGACGACCAGTGCTTCGTCCACCCCTGCACTGGT 

GTGGGCGAGTGTCGGTCTTCCAGTCTCCAGCCGGTGAAGACAAAGTGCACCTCTGACT 

CCTATTACCAGGATAACTGTGCGAACATCACATTTACCTTTAACAAGGAGATGATGTCA 

CCAGGTCTTACTACGGAGCACATTTGCAGTGAATTGAGGAATTTGAATATTTTGAAGAA 

TGTTTCCGCTGAATATTCAATCTACATCGCTTGCGAGCCTTCCCCTTCAGCGAACAATG 

AAATACATGTGGCCATTTCTGCTGAAGATATACGGGATGATGGGAACCCGATCAAG 

GAAATCACTGACAAAATAATCGATCTTGTTAGTAAACGTGATGGAAACAGCTCGCTGA 

TTGCTGCCGTTGCAGAAGTAAGAGTTCAGAGGCGGCCTCTGAAGAACAGAACAGATTT 
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ccttgttcccttgctgagctctgtcttaactgtggcttggatctgttgcttggtgacggc 

cttctactggtgcctgcggaagcggcggaagccgggcagccacacacactcagcctct 

gaggacaacaccaccaacaacgtgcgggagcagctgaaccagataaaaaccccattg 

agaaacatggggccaacacggtccccatcaaggattacgagaacaagaactccaaat 

Gtctaaaataaggacacacaattctgaagtagaagaggacgacatggacaaacacca 

gcagaaagcccggtttgccaagcagccggcgtatacgctggtagacagagaagagaa 

gccccccaacggcacgccgacaacacccaaactggacaaacaaacaggacaacagag 

acttggaaagtgcccagagcttaaaccgaatggagacatcgtatagcagaccgcgggc 

actgccgccgctaggtagagtctgagggcttgtagttctttaaactgtcgtgtcatact 

CGAGTCTGAGGCCGTTGCTGACTTAGAATCCCTGTGTTAATTTAAGTTTTGACAAGCTG 

GCTTACACTGGCAATGGTAGTTTCTGTGGTTGGCTGGGAAATCGAGTGCCGCATCTCAC 

AGCTATGCAAAAAGCTAGTCAACAGTACCCTGGTTGTGTGTCCCCTTGCAGCCGACAC 

GGTCTCGGATCAGGCTCCCAGGAGCCTGCCCAGCCCCCTGGTCTTTGAGCTCCCACTTC 

TGCCAGATGTCCTAATGGTGATGCAGTCTTAGATCATAGTTTTATTTATATTTATTGACT 

CTTGAGTTGTTTTTGTATATTGGTTTTATGATGACGTACAAGTAGTTCTGTATTTGAAAG 

TGCCTTTGCAGCTCAGAACCACAGCAACGATCACAAATGACTTTATTATTTATTTTTTTA 

ATTGTATTTTTGTTGTTGGGGGAGGGGAGACTTTGATGTCAGCAGTTGCTGGTAAAATG 

AAGAATTTAAAGAAAAAAATGTCAAAAGTAGAACTTTGTATAGTTATGTAAATAATTC 

TTTTTTATTAATCACTGTGTATATTTGATTTATTAACTTAATAATCAAGAGCCTTAAAAC 

ATCATTCCTTTTTATTTATATGTATGTGTTTAGAATTGAAGGTTTTTGATAGCATTGTAA 

GCGTATGGCTTTATTTTTTTGAACTCTTCTCATTACTTGTTGCCTATAAGCCAAAATTAA 

GGTGTTTGAAAATAGTTTATTTTAAAACAATAGGATGGGCTTCTGTGCCCAGAATACTG 

ATGGAATTTTTTTTGTACGACGTCAGATGTTTAAAACACCTTCTATAGCATCACTTAA 

AACACGTTTTAAGGACTGACTGAGGCAGTTTGAGGATTAGTTTAGAACAGGTTTTTTTG 

TTTGTTTGTTTTTTGTTTTTCTGCTTTAGACTTGAAAAGAGACAGGCAGGTGATCTGCTG 

CAGAGCAGTAAGGGAACAAGTTGAGCTATGACTTAACATAGCCAAAATGTGAGTGGTT 

GAATATGATTAAAAATATCAAATTAATTGTGTGAACTTGGAAGCACACCAATCTGACTT 

TGTAAATTCTGATTTCTTTTCACCATTCGTACATAATACTGAACCACTTGTAGATTTGAT 

TTTTTTTTTAATCTACTGCATTTAGGGAGTATTCTAATAAGCTAGTTGAATACTTGAACC 

ATAAAATGTCCAGTAAGATCACTGTTTAGATTTGCCATAGAGTACACTGCCTGCCTTAA 

GTGAGGAAATCAAAGTGCTATTACGAAGTTCAAGATCAAAAAGGCTTATAAAACAGAG 

TAATCTTGTTGGTTCACCATTGAGACCGTGAAGATACTTTGTATTGTCCTATTAGTGTTA 

TATGAACATACAAATGCATCTTTGATGTGTTGTTCTTGGCAATAAATTTTGAAAAGTAA 

TATTTATTAAATTTTTTTGTATGAAAACATGGAACAGTGTGGCTCTTCTGAGCTTACGTA 

GTTCTACCGGCTTTGCCGTGTGCTTCTGCCACCCTGCTGAGTCTGTTCTGGTAATCGGGG 

TATAATAGGCTCTGCCTGACAGAGGGATGGAGGAAGAACTGAAAGGCTTTTCAACCC 

AAAACTCATCTGGAGTTCTCAAAGACCTGGGGCTGCTGTGAAGCTGGAACTGCGGGAG 

CCCCATCTAGGGGAGCCTTGATTCCCTTGTTATTCAACAGCAAGTGTGAATACTGCTTG 

AATAAACACCACTGGATTAATGGAAAAAAAAAAAAAAAA 



Figure 6 

w 

MRSPRTRGRSGRPLSLLLALLCALRAKVCGASGQFELEmSMQNVNGELQNGNCCGGARN 

PGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSTPVIGGNTFNLKASRGNDRNRIV 

LPFSF AWPRS YTLLVEAWDS SNDTVQPDSIIEKASHS GMINP SRQWQTLKQNTGVAHFEYQ 

IRVTCDDYYYGFGCNEFCRPP^DFFGHYACDQNGr^TCMEGWMGPECNRAICRQGCSPK 

HGSCKLPGDCRCQYGWQGLYCDKCIPHPGCVHGICNEPWQCLCETNW'GGQLCDKDLNYC 

GTHQPCLNGGTCSNTGPDKYQCSCPEGYSGPNCEIAEHACLSDPCHNRGSCKETSLGFECE 

CSPGWTGPTCSTr^DCSPrWCSHGGTCQDLWGFKCVCPPQWTGKTCQLDANECEAE? 

CVNAKSCKNLIASYYCDCLPGWMGQNCDEn^^ 

GDHCERDroECASKPCLNGGHCQNEE^QCLCPTGFSGNLCQLDIDYCEPNPCQNGAQCY 
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NRASDYFCKCPEDYEGKNCSHL^ 

GKCKSQSGGKFTCDCNKGFTGTYCHENINDCESNPCRNGGTCIDGVNSYKCICSDGWEGA 
YCETNDSIDCSQNPCHNGGTCRDLV]Sro^ 

DEGDAFKCMCPGGWEGTTCNIAKNSSCLPNPCHNGGTCVVNGESFTCVCKEGWEGPICAQ 
NTMDCSPHPCYNSGTCVDGDNWYT^^ 

GYRCVCPPGHSGAKCQEVSGPJ > CITMGSVn > DGAKWT)DDCNTCQCLNGRIACSKVWCGPR 

PCLLHKGHSECPSGQSCIPILDDQCFVHPCTGVGECRSSSLQPVKTKCTSDSYYQDNCANIT 

FTFNKEMMSPGLTTEfflCSELRNLNILKNVSAEYSIYIACEPSPSANNEIHVAIS 

PIECEITDEJIDLVSKSDGNSSLIAAVAEVRVQRRPLKISfRTDFLW 

YWCLRE3lRKPGSHTHSASEDNTTNTSrV/REQLNQIK]S^ 

THNSEVEEDDMDIOHQQKAFJFAKQPAYTLVDREEKPPNGTPTKHPNWTNKQDNRDLESAQ 
SLNRMEYIV 



Figure 7 

GGAGCGGGCGCGCGGCGGCGGCGGGGCCGCGGCGGGCGGGTCGCGGGGGCAATGCGG 

GCGCAGGGCCGGGGCCTTCCCCCCGGCGCTGCTGCTGCTGCTGGCGCTCTGGGTGCAG 

GCGGCGCGGCCCATGGGCTATTTCGAGCTGCAGCTGAGCGCGCTGCGGAACGTGAACG 

GGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGCGGCCGGACAACGCGCGCGGGGGG 

CTGCGGCCACGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAGAGTACCAGGCCA 

AGGTGACGCCCACGGGGCCCTGCAGCTACGGCCACGGCGCCACGCCCGTGCTGGGCG 

CAACTCCTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGCCC 

CGGGCCGGCGCGACCAGGACCCGGGCTTCGTCGTCATCCCCTTCCAGTTCGCCTGGCCG 

CGCTCCTTTACCCTCATCGTGGAGGCCTGGGACTGGGACAACGATACCACCCCGAATG 

AGGAGCTGCTGATCGAGCGAGTGTCGCATGCCGCATGATCAACCCGGAGGACCGCTGG 

AAGAGCCTGCACTTCAGCGGCCACGTGGCGCACCTGGAGCTGCGATCCGCGTGCGCTG 

CGACGAGAACTACTACAGCGCCACTTGCAACAAGTTCTGCCGGCCCCGCAACGACT 

TTTTCGGCCACTACACCTGCGACCAGTACGGCAACAAGGCCTGCATGGACGGCTGGAT 

GGGCAAGGAGTGCAAGGAAGCTGTGTGTAAACAAGGGTGTAATTTGCTCCACGGGGG 

ATGCACCGTGCCTGGGGAGTGCAGTGCAGCTACGGCTGGCAAGGGAGGTTCTGCGATG 

AGTGTGTCCCCTACCCCGGCTGCGTGCATGGCAGTTGTGTGGAGCCCTGGCAGTGCAA 

CTGTGAGACCAACTGGGGCGGCCTGCTCTGTGACAAAGACCTGAACTACTGTGGCAGC 

CACCACCCCTGCACCAACGGAGGCACGTGCATCAACGCCGAGCCTGACCAGTACCGCT 

GCACCTGCCCTGACGGCTACTCGGGCAGGAACTGTGAGAAGGCTGAGCACGCCTGCAC 

CTCCAACCCGTGTGCCAACGGGGGCTCTTGCCATGAGGTGCCGTCCGGCTTCGAATGCC 

ACTGCCCATCGGGCTGGAGCGGGCCCACCTGTGCCCTTGACATCGATGAGTGTGCTTCG 

AACCCGTGTGCGGCCGGTGGCACCTGTGTGGACCAGGTGGACGGCTTTGAGTGCATCT 

GCCCCGAGCAGTGGGTGGGGGCCACCTGCCAGCTGGACGTCAACGACTGTGAAGGGA 

AGCCATGCCTTAACGCTTTTTCTTGCAAAAACCTGATTGGCGGCTATTACTGTGATTGC 

ATCCCGGGCTGGAAGGGCATCAACTGCCATATCAACGTCAACGACTGTCGCGGGCAGT 

GTCAGCATGGGGCACCTGCAAGGACCTGGTGAACGGGTACCAGTGTGTGTGCCCACGG 

GGCTTCGGAGGCCGGCATTGCGAGCTGGAACGAGACAAGTGTGCCAGCAGCCCCTGCC 

ACAGCGGCGGCCTCTGCGAGGACCTGGCCGACGGCTCCACTGCCACTGCCCCCAGGGC 

TTCTCCGGGCCTCTCTGTGAGGTGGATGTCGACCTTTGTGAGCCAAGCCCCTGCCGGAA 

CGGCGCTCGCTGCTATAACCTGGAGGGTGACTATTACTGCGCCTGCCCTGATGACTTTG 

GTGGCAAGAACTGCTCCGTGCCCCGCGAGCCGTGCCCTGGCGGGGCCTGCAGAGTGAT 

CGATGGCTGCGGGTCAGACGCGGGGCCTGGGATGCCTGGCACAGCAGCCTCCGGCGTG 

TGTGGCCCCCATGGACGCTGCGTCAGCCAGCCAGGGGGCAACTTTTCCTGCATCTGTGA 

CAGTGGCTTTACTGGCACCTACTGCCATGAGAACATTGACGACTGCCTGGGCCAGCCCT 

GCCGCAATGGGGGCACATGCATCGATGAGGTGGACGCCTTCCGCTGCTTCTGCCCCAG 

CGGCTGGGAGGGCGAGCTCTGCGACACCAATCCCAACGACTGCCTTCCCGATCCCTGC 
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CACAGCCGCGGCCGCTGCTACGACCTGGTCAATGACTTCTACTGTGCGTGCGACGACG 

GCTGGAAGGGCAAGACCTGCCACTCACGCGAGTTCCAGTGCGATGCCTACACCTGCAG 

CAACGGTGGCACCTGCTACGACAGCGGCGACACCTTCCGCTGCGCCTGCCCCCCCGGC 

TGGAAGGGCAGCACCTGCGCCGTCGCCAAGAACAGCAGCTGCCTGCCCAACCCCTGTG 

TGAATGGTGGCACCTGCGTGGGCAGCGGGGCCTCCTTCTCCTGCATCTGCCGGGACGG 

CTGGGAGGGTCGTACTTGCACTCACAATACCAACGACTGCAACCCTCTGCCTTGCTACA 

ATGGTGGCATCTGTGTTGACGGCGTCAACTGGTTCCGCTGCGAGTGTGCACCTGGCTTC 

GCGGGGCCTGACTGCCGCATCAACATCGACGAGTGCCAGTCCTCGCCCTGTGCCTACG 

GGGCCACGTGTGTGGATGAGATCAACGGGTATCGCTGTAGCTGCCCACCCGGCCGAGC 

CGGCCCCCGGTGCCAGGAAGTGATCGGGTTCGGGAGATCCTGCTGGTCCCGGGGCACT 

CCGTTCCCACACGGAAGCTCCTGGGTGGAAGACTGCAACAGCTGCCGCTGCCTGGATG 

GCCGCCGTGACTGCAGCAAGGTGTGGTGCGGATGGAAGCCTTGTCTGCTGGCCGGCCA 

GCCCGAGGCCCTGAGCGCCCAGTGCCCACTGGGGCAAAGGTGCCTGGAGAAGGCCCC 

AGGCCAGTGTCTGGACCACCCTGTGAGGCCTGGGGGGAGTGCGGCGCAGAAGAGCCA 

CCGAGCACCCCCTGCCTGCCACGCTCGGCCACCTGGACAATAACTGTGCCCGCCTCACC 

TTGCATTTCAACCGTGACCACGTGCCCCAGGGCACCACGGTGGGCGCCATTTGCTCCGG 

GATCCGCTCCCTGCCAGCCACAAGGGCTGTGGCACGGGACCGCCTGCTGGTGTTGCTTT 

GCGACCGGGCGTCCTCGGGGGCCAGTGCCGTGGAGGTGGCCGTGTCCTTCAGCCCTGC 

CAGGGACCTGCCTGACAGCAGCCTGATCCAGGGCGCGGCCCACGCCATCGTGGCCGCC 

ATCACCCAGCGGGGAACAGCTCACTGCTCCTGGCTGTCACCGAGGTCAAGGTGGAGAC 

GGTTGTTACGGGCGGCTCTTCCACAGGTCTGCTGGTGCCTGTGCTGTGTGGTGCCTTCA 

GCGTGCTGTGGCTGGCGTGCGTGGTCCTGTGCGTGTGGTGGACACGCAAGCGCAGGAA 

AGAGCGGGAGAGGAGCCGGCTGCCGCGGGAGGAGAGCGCCAACACAGTGGGCCCCGC 

TCAACCCCATCCGCAACCCCATCGAGCGGCCGGGGGGCCACAAGGACGTGCTCTACCA 

GTGCAAGAACTTCACGCCGCCGCCGCGCAGGGCGGACGAGGCGCTGCCCGGGCCGGC 

CGGCCACGCGGCGTCAGGGAGGATGAGGAGGACGAGGATCTGGGCCGCGGTGAGGAG 

GACTCCCTGGAGGCGGAGAAGTTCTCTCACACAAATTCACCAAAGATCCTGGCCGCTC 

GCCGGGGAGGCCGGCCCACTGGGCCTCAGGCCCCAAAGTGGACAACCGCGCGGTCAG 

GAGCATCAATGAGGCCCGCTACGCCGGCAAGGAGTAGGGGCGGCTGCGCTGGGCCGG 

GACCCAGGGCCCTCGGTGGGAGCCATGCCGTCTGCCGGACCCGGAGCCGAGGCATGTG 

CTAGTTTCTTTATTTTGTGTAAAAAAACCACCAAAAACAAAAACCAAATGTTTATTTTC 

TACGTTTCTTTAACCTTGTATAAATTATTCAGTAACTGTCAGGCTGAAAACAATGGAGT 

ATTCTCGGATAGTTGCTATTTTTGTAAAGTTTCCGTGCGTGGCACTCGCTGTATGAAAG 

GAGAGAGCAAAGGGTGTCTGCGTCGTCACCAAATCGTAGCGTTTGTTACCAGAGGTTG 

TGCACTGTTTACAGAATCTTCCTTTTATTCCTCACTCGGGTTTCTCTGTGGCTCCAGGCC 

AAAGTGCCGGTGAGACCCATGGCTGTGTTGGTGTGGCCCATGGCTGTTGGTGGGACC 

CGTGGCTGATGGTGTGGCCTGTGGCTGTCGGTGGGACTCGTGGCTGTCAATGGGACCTG 

TGGCTGTCGGTGGGACCTACGGTGGTCGGTGGGACCCTGGTTATTGATGTGGCCCTGGC 

TGCCGGCACGGCCCGTGGCTGTTGACGCACCTGTGGTTGTTAGTGGGGCCTGAGGTCAT 

CGGCGTGCCCAAGGCCGGCAGGTCAACCTCGCGCTTGCTGGCCAGTCCACCCTGCCTG 

CCGTCTGTGCTTCCTCCTGCCCAGAACGCCCGCTCCAGCGATCTCTCCACTGTGCTTTCA 

GAAGTGCCCTTCCTGCTGCGCAGTTCTCCCATCCTGGGACGGCGGCAGTATTGAAGCTC 

GTGACAAGTGCCTTCACACAGACCCCTCGCAACTGTCCACGCGTGCCGTGGCACCAGG 

CGCTGCCCACCTGCCGGCCCCGGCCGCCCCTCCTCGTGAAAGTGCATTTTTGTAAATGT 

GTACATATTAAAGGAAGCACTCTGTATATTTGATTGAATAATGCCACCAAAAAAAAAA 

AAAAAAAAAAATTCCTGCCC : 



Figure 8 

MP^QGRGAFTPALLLLLALWVQAA^ 

GGCGHDECDTYVRVCLKEYQAKVTPTGPCSYGHGATPVLGGNSFYLPPAGAAGDRARAR 
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PRAGGDQDPGFWIPFQFAWPRSFTLIVEAWDWDNDTTPNEELLIERVSHAGMINPEDRWK 
SLOTS GHVAHLELQIRVRCDENYY^^ 

KECKEAVCKQGCNLLHGGCT VPGECRCS YGWQ GRFCDECVPP GC VHGS CVEP WQCNCET 

NWGGLLCDKDLNYCGSHHP CTNGGTCESTAEPDQYRCTCPD GYS GRNCEKAEHACTSNP C 

ANGGSCHEVPSGFECHCPSGWSGPTCALDIDECASNPCAAGGTCVDQVDGFECICPEQWV 

GATCQLDVNDCEGKRCLNAFSCKmiGGYYCDCrPGWKGlNCHINVNDCRGQCQHGGTCK 

DLVNGY QC VCPRGFGGRHCELERDKC AS SPCHS GGLCEDLADGFHCHCPQGFS GPLCEVD 

VDLCEPSPCRNGARCYNLEGDYYCACPDDFGGKNCSVPREPCPGGACRVIDGCGSDAGPG 

MPGTAASGVCGPHGRCVSQPGGNFSCICDSGFTGTYCHENIDDCLGQPCRNGGTCrDEVDA 

FRCFCPSGWEGELCDTNPNDCLPDPCHSRGRCYDLVNDFYCACDDGWKGKTCHSREFQC 

D AYTCSNGGTC YD S GDTFRC ACPPGWKGSTC AVAKNS S CLPNP CVNGGTC VGS GAS FSCI 

CRDGWEGRTCTHNT]NroCNPLP 

AYGATCVDEINGYRCSCPPGRAGPRCQEVIGFGRSCWSRGTPFPHGSSWVEDCNSCRCLDG 

RRDCSKVWCGWKPCLLAGQPEALSAQCPLGQRCLEKAPGQCLRPPCEAWGECGAEEPPST 

PCLPRSGHLDNNCARLTLHFNRDHVPQGTTVGAICSGIRSLPATRAVARDRLLVLLCDRAS 

SGASAVEVAVSFSPARDLPDSSLIQGAAHAIVAAITQRGNSSLLLAVTEVKVETVVTGGSST 

GLLWVLCGAFSVLWLACVVLCVWWTRKRRKERERSRLPREESANNQWAPLNPIRNPIER 

PGGHKDVLYQCKNFTPPPRRADEALPGPAGHAAVREDEEDEDLGRGEEDSLEAEKFLSHK 

FTKDPGRSPGRP AHWAS GPK VDNRA VRS INE ARYAGKE 



Figure 9 

MRSPRTRGRPGRPLSLLLALLCALRAKVCGASGQFELEILSMQNVNGELQNGNCCGGVRN 

PGDRKCTRDECDTYFKVCLKEYQSRVTAGGPCSFGSGSTPVIGGNTFNLKASRGNDRNRIV 

LPFSFAWPRSYTLLVEAWDSSNDTIQPDSIIEKASHSGM3KPSRQWQTLKQNTGIAHFEYQIR 

VTCDDHYYGFGCNKFCRPRDDFFGHYACDQNGNKTCMEGWMGPDCKKAICRQGCSPKH 

GSCKXPGDCRCQYGWQGLYCDKCE'HPGCVHGTCNEPWQCLCETNWGGQLCDKDLNYC 

GTHQPCLNRGTCSNTGPDKYQCSCPEGYSGPNCEIAEHACLSDPCHNRGSCKETSSGFECE 

CSPGWTGPTCSTNEDDCSPlSQSrCSHGGTCQDLVNGFKCVCPPQWTGKTCQLDAlSrECEAKPC 

VNARSCKNEIASYYCDCLPGWMGQNCDn^nNDCLGQCQl^ 

DHCEPJ)IDECASNPCLNGGHCQOTINRFQCLCPTGFSGNLCQLDIDYCEPNPCQNGAQCYN 

RASDYFCKCPEDYEGKNCSHLKDHCRTTTCEVmSCTVAMASNDTPEGVRYISSNVCGPHG 

KCKS QSGGKFTCD C>^GFTGT YCBENINDCESNP CKNGGTCID GVNS YKCIC SDGWEGAH 

CENNINDCSQNPCHYGGTCPJDLVIWFYCDCK^GWKGKTCHSRDSQCDEATCNNGGTCY 

DEVDTFKCMCPGGWEGTTCNIARNSSCLPNPCHNGGTCVVNGDSFTCVCKEGWEGPICTQ 

NTNDCSPHDPCYNSGTCVDGDlSrWYRCECAPGFAGPDCRIISnKECQSSPCAFGATCVDEn^GY 

QCICPPGHSGAKCHEVSGRSCITMGRVn^DGAKWDDDCNTCQCLNGRVACSKrVWCGPRPC 

RLHKSHNECPS GQSCIP VLDDQCFVRPCTGVGECRS S SLQP VKTKCTSDS YYQDNC AMTFT 

FNKEMMSPGLTTEHICSELPJvnLlSIILKlSrVSAEYSrYIACE 

VKEITDKIIDLVSKRDGNSSLIAAVAEVRVQRRPLKINRTDFLWLLSSVLTV 

YWCVRECRRKPSSHTHSAPEDNTTTSnSTVPsEQLNQIKl^ 

Tm^SEVEEDDMDKHQQKVRFAKQPVYTLVDREEKAPSGTPTKHPNWTNKQDNRDLESAQ 
SLNRMEYTV 



Figure 10 

TCGAGGCGGCGATGCGGGCACGCGGCTGGGGACGCCTGCCTCGGCGGCTGCTGCTGCT 
ACTGGTTCTGTGCGTGCAGGCGACGCGGCCCATGGGCTATTTCGAGCTGCAGCTGAGC 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



10/66 



PCT/GB02/01195 



GCGCTGCGGAACGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGACGGC 

CGGACGACGCGCGCGGGGGGCTGCGGCCGCGACGAGTGCGACACGTACGTGCGCGTG 

TGCCTTAAGGAGTACCAGGCCAAGGTGACGCCCACGGGGCCCTGCAGCTACGGCTACG 

GCGCCACGCCCGTGCTGGGTGGCAACTCCTTCTACCTGCCGCCGGCGGGCGCTGCGGG 

GGACCGAGCGCGCGCGCGGTCTCGGACCGGCGGCCACCAGGACCCGGGCCTCGTCGTC 

ATTCCCTTTCAGTTCGCCTGGCCGCGTTCTTTCACCCTCATCGTGGAGGCCTGGGACTG 

GGACAATGACACCACTCCAGATGAGGAGCTGCTGATTGAGCGGGTGTCGCACGCTGGC 

ATGATCAACCCCGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACGTGGCACACC 

TGGAGCTGCAGATGCGAGTGCGCTGTGATGAGAACTACTACAGTGCCACCTGCAACAA 

GTTCTGCCGGCCCCGCAACGACTTCTTTGGCCACTATACCTGCGACCAGTACGGCAACA 

AGGCCTGCATGGATGGCTGGATGGGCAAAGAATGCAAAGAAGCCGTGTGTAAACAAG 

GATGTAATTTGCTCCACGGGGGATGCACTGTGCCTGGGGAGTGCAGGTGCAGCTACGG 

CTGGCAGGGCAAGTTCTGTGACGAGTGTGTCCCCTACCCTGGCTGCGTGCATGGCAGCT 

GTGTGGAGCCCTGGCACTGTGACTGTGAGACCAACTGGGGTGGCCTGCTCTGCGACAA 

AGACCTGAACTACTGTGGCAGCCACCACCCCTGTGTCAACGGGGGTACCTGCATCAAT 

GCTGAGCCTGACCAATACCTCTGCGCCTGCCCAGATGGCTACTTGGGCAAGAACTGTG 

AGCGGGCTGAGCACGCCTGTGCCTCCAACCCGTGTGCCAATGGGGGCTCTTGCCACGA 

AGTGCCATCTGGCTTTGAATGCCACTGTCCGTCAGGATGGAGCGGACCCACCTGTGCG 

CTCGACATTGATGAGTGTGCCTCTAACCCATGTGCAGCGGGTGGTACCTGCGTGGATCA 

GGTGGACGGCTTCGAGTGCATCTGCCCGGAGCAGTGGGTGGGGGCTACTTGCCAGCTG 

GACGCCAATGAGTGTGAAGGGAAGCCGTGCCTTAATGCTTTTTCTTGCAAAAACCTGAT 

TGGCGGCTATTACTGTGATTGCCTCCCGGGCTGGAAGGGCATCAACTGCCAAATCAAC 

ATCAACGATTGTCATGGGCAGTGTCAGCATGGGGGCACCTGCAAGGACCTGGTCAATG 

GGTACCAGTGTGTGTGCCCGCGGGGCTTTGGAGGTCGCCATTGCGAACTAGAGTACGA 

CAAGTGTGCCAGCAGCCCCTGCCGCCGGGGTGGCATCTGCGAGGACCTGGTGGATGGC 

TTCCGCTGCCACTGCCCACGGGGCCTCTCTGGGCTGCACTGTGAGGTGGACATGGATCT 

CTGTGAACCAAGCCCCTGCCTCAACGGTGCTCGCTGCTACAACCTTGAGGGTGACTACT 

ACTGCGCCTGCCCAGAAGACTTTGGTGGCAAGAACTGCTCAGTGCCCAGGGACACATG 

CCCTGGCGGGGCATGTAGAGTGATCGATGGCTGCGGGTTCGAGGCAGGGTCCAGGGCA 

CGCGGTGTCGCACCCTCTGGTATATGTGGCCCTCACGGGCACTGCGTTAGCCTGCCTGG 

GGGAAACTTCTCCTGCATCTGTGACAGCGGCTTCACAGGCACCTACTGCCATGAAAAC 

ATTGACGACTGCATGGGCCAGCCCTGCCGCAACGGGGGCACGTGCATTGACGAAGTGG 

ACTCCTTCCGCTGCTTCTGCCCCAGTGGCTGGGAAGGAGAACTCTGTGACATCAATCCC 

AACGACTGCCTCCCCGACCCCTGCCACAGCCGCGGCCGCTGCTATGACCTGGTCAATG 

ACTTCTACTGTGCCTGTGACGATGGCTGGAAGGGCAAGACCTGCCACTCACGCGAGTT 

CCAGTGTGACGCCTACACCTGCAGCAACGGTGGCACATGCTATGACAGCGGCGACACC 

TTCCGCTGCGCGTGCCCTCCGGGCTGGAAGGGCAGCACCTGCACCATCGCCAAGAACA 

GCAGCTGTGTGCCCAATCCCTGTGTGAATGGAGGCACCTGCGTGGGTAGCGGAGACTC 

TTTCTCCTGCATCTGCCGGGATGGCTGGGAGGGCCGCACCTGCACACATAACACCAAT 

GACTGCAACCCTCTGCCCTGCTATAACGGAGGCATCTGTGTTGATGGCGTCAACTGGTT 

CCGCTGCGAGTGTGCGCCTGGCTTTGCGGGTCCTGACTGCCGTATCAACATTGATGAGT 

GCCAGTCCTCGCCCTGTGCCTACGGAGCCACGTGTGTGGATGAGATCAACGGGTACCG 

CTGCAGCTGCCCACCAGGTCGTTCTGGCCCCAGGTGCCAGGAAGTGGTCATATTCACG 

AGGCCCTGCTGGTCCCGGGGAATGTCCTTCCCGCATGGGAGTTCCTGGATGGAAGACT 

GCAACAGCTGCCGCTGCCTGGATGGCCACCGGGATTGTAGCAAGGTATGGTGCGGATG 

GAAGCCTTGCCTGCTCTCTGGTCAGCCCAGCGATCCGAGTGCCCAGTGCCCCCCAGGG 

CAGCAATGTCAGGAGAAGGCCGTGGGTCAGTGCTTGCAGCCACCCTGTGAGAACTGGG 

GGGAGTGTACAGCGGAGGAGCCTCTGCCACCCAGCACCCCCTGTCAGCCACGGAGCAG 

TCATTTGGACAACAACTGTGCCCGACTCACACTGCGCTTCAACCGTGATCAAGTGCCTC 

AGGGCACCACCGTGGGCGCTATCTGCTCTGGAATCCGAGCCTTGCCTGCCACGAGGGC 

GGCGGCACACGACCGCCTCCTCCTGCTGCTTTGTGATCGAGCATCCTCGGGGGCCAGTG 

CTGTGGAGGTGGCTATGTCTTTCAGCCCTGCAAGGGACCTGCCTGACAGCAGCCTGATC 
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CAGAGCACAGCCCACGCCATCGTGGCTGCTATCACTCAGAGAGGAAATAGCTCACTGC 

TGCTGGCTGTCACCGAGGTCAAGGTGGAAACAGTTGTTATGGGTGGCTCTTCCACAGGT 

CTGTTGGTGCCCGTGCTGTGCAGCGTGTTCAGTGTGCTGTGGCTCGCCTGTGTGGTTAT 

CTGCGTATGGTGGACACGAAAGCGCAGGAAAGAACGTGAGAGGAGCCGGCTACCACG 

GGATGAGAGCACCAACAACCAGTGGGCCCCGCTCAATCCCATCCGCAACCCCATTGAG 

CGGCCAGGCGGCAGCGGTCTGGGAACTGGGGGCCACAAGGACATACTCTACCAGTGC 

AAAAACTTCACACCGCCGCCCCGCAGGGCAGGCGAGGCACTGCCCGGGCCAGCTGGCC 

ATGGGGCTGGTGGGGAGGACGAGGAGGATGAAGAGCTGAGCCGTGGAGATGGGGACT 

CCCCAGAGGCAGAGAAGTTCATCTCACACAAGTTCACCAAAGACCCCAGCTGCTCCCT 

CGGAAGGCCAGCCTGCTGGGCTCCAGGGCCCAAAGTGGACAACCGCGCCGTCAGAAG 

TACCAAGGACGTGCGCCGTGCTGGCAGGGAGTAGCCAGCCACCAGGCTGGCACCCAG 

AACCCTTGCTGGCACCACGCTGCCTGCCGGACCATAGGAGGCCAAGGCCGTGTGCATA 

GTTTCTTTATTTTGTGTAAAAAACAAAACCAAAACCAAAAAACAAATGTTTATTTTTTA 

CGTTTCTTTAACCTTGTATAAATTATTCAACGGCTGTCAGGCGGAAAACAACGGAGTAT 

TCTCGGATCATTGCTATTTTTGTAAAGTTTCCGCGTCCGCACGCACTGTGGCAGGAGAG 

CAGGGCGTGTGTATGTGTGTGTGTGTGTGTCCTCACC 



Figure 11 

MLCDKDLNYCGSHHPCVNGGTCINAEPDQYLCACPDGYLGKNCERAEHACASNPCANGG 

SCHEVPSGFECHCPSGWSGPTCALDIDECASNPCAAGGTCVDQVDGFECICPEQWVGATC 

QLDANECEGKPCmAFSCKmiGGYYCDCIPGWKGINCQITINDCHGQVSAWGHLQGPVN 

GYQC VCPRGFGVRHCELE YDKC AS SPCRRGGICEDLVD GFRCHCPRGLS GLHCE VDMDLC 

EPSPCFNGVRCYNLEGDYYCACPEDFGGKNCSVPRDTCPGGACRVIDGCGFEAGSRARGV 

APSGICGPHGHCVSLPGGNFSCICDSGFTGTYCHElSnDDCMGQPCRNGGTCIDEVDSFRCFC 

PSGWEGELCDINPNDCLPDPCHSRGRCYDLVNDFYCACDDGWKGKTCHSREFQCDAYTC 

SNGGTC YD S GDTFRC ACPPGWKGS TCTIAKNSS C VPNPC VNGGTC VGS GD SFS CICRDGWE 

GRTCTHNTNDCNPLPCYNGGICVDGVHWFACECAPGF 



Figure 12 

GAAGGCCATGGTCTCCCCACGGATGTCCGGGCTCCTCTCCCAGACTGTGATCCTAGCGC 

TCATTTTCCTCCCCCAGACACGGCCCGCTGGCGTCTTCGAGCTGCAGATCCACTCTTTC 

GGGCCGGGTCCAGGCCCTGGGGCGCCGCGGTCCCCCTGCAGCGCCCGGCTCCCCTGCC 

GCCTCTTCTTCAGAGTCTGCCTGAAGCCTGGGCTCTCAGAGGAGGCCGCCGAGTCCCCG 

TGCGCCCTGGGCGCGGCGCTGAGTGCGCGCGGACCGGTCTACACCGAGCAGCCCGGAG 

CGCCCGCGCCTGATCTCCCACTGCCCGACGGGCTCTTGCAGGTGCCCTTCCGGGACG 

CCTGGCCTGGCACCTTCTCTTTCATCATCGAAACCTGGAGAGAGGAGTTAGGAGACCA 

GATTGGAGGGCCCGCCTGGAGCCTGCTGGCGCGCGTGGCTGGCAGGCGGCGCTTGGCA 

GCCGGAGGCCCGTGGGCCCGGGCATTCAGCGCGCAGGCGCCTGGGAGCTGCGCTTCTC 

GTACCGCGCGCGCTGCGAGCCGCCTGCCGTCGGGACCGCGTGCACGCGCCTCTGCCGT 

CCGCGCAGCGCCCCCTCGCGGTGCGGTCCGGGACTGCGCCCCTGCGCACCGCTCGAGG 

ACGAATGTGAGGCGCCGCTGGTGTGCCGAGCAGGCTGCAGCCCTGAGCATGGCTTCTG 

TGAACAGCCCGGTGAATGCCGATGCCTAGAGGGCTGGACTGGACCCCTCTGCACGGTC 

CCTGTCTCCACCAGCAGCTGCCTCAGCCCCAGGGGCCCGTCCTCTGCTACCACCGGATG 

CCTTGTCCCTGGGCCTGGGCCCTGTGACGGGAACCCGTGTGCCAATGGAGGCAGCTGT 

AGTGAGACACCCAGGTCCTTTGAATGCACCTGCCCGCGTGGGTTCTACGGGCTGCGGT 

GTGAGGTGAGCGGGGTGACATGTGCAGATGGACCCTGCTTCAACGGCG 
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GCTTGTGTGTCGGGGGTGCAGACCCTGACTCTGCCTACATCTGCCACTGCCCACCTGGT 

TTCCAAGGCTCCAACTGTGAGAAGAGGGTGGACCGGTGCAGCCTGCAGCCATGCCGCA 

ATGGCGGACTCTGCCTGGACCGGGCCACGCCCTGCGCTGCCGCTGCCGCGCCGGCTTC 

GCGGGTCCTCGCTGCGAGCACGACCTGGACGACTGCGCGGGCCGCGCCTGCGCTAACG 

GCGGCACGTGTGTGGAGGGCGGCGGCGCGCACCGCTGCTCCTGCCGCTGGGCTTCGGC 

GGCCGCGACTGCCGCGAGCGCGCGGACCCGTGCGCCGCGCGCCCCTGTGCTCACGGC 

GGCCGCTGCTACGCCCACTTCTCCGGCCTCGTCTGCGCTTGCGCTCCCGGCTACATGGG 

AGCGCGGTGTGAGTTCCCAGTGCACCCCGACGGCGCAAGCGCCTTGCCCGCGGCCCCG 

CCGGGCCTCAGGCCCGGGGACCCTCAGCGCTACCTTTTGCCTCCGGCTCTGGGACTGCT 

CGTGGCCGCGGGCGTGGCCGGCGCTGCGCTCTTGCTGGTCCACGTGCGCCGCCGTGGC 

CACTCCCAGGATGCTGGGTCTCGCTTGCTGGCTGGGACCCCGGAGCCGTCAGTCCACG 

CACTCCCGGATGCACTCAACAACCTAAGGACGCAGGAGGGTTCCGGGGATGGTCCGG 

CTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCAAGGGATTTATGTCATAT 

CTGCTCCTTCCATCTACGCTCGGGAGGTAGCGACGCCCCTTTTCCCCCCGCTACACACT 

GGGCGCGCTGGGCAGAGGCAGCACCTGCTTTTTCCCTACCCTTCCTCGATTCTGTCCGT 

GAAATGAATTGGGTAGAGTCTCTGGAAGGTTTTAAGCCCATTTTCAGTTCTAACTTACT 

TTCATCCTATTTTGCATCCCTCTTATCGTTTTGAGCTACCTGCCATCTTCTCTTT 



Figure 13 

MVSPI^SGLLSQTVILALIFLPQTRPAGVFELQIHSFGPGPGPGAPRSPCSARLPCRLFFRVC 

LBa'GLSEEAAJESPCAmAALSARGPVYTEQPGAPAPDLPLPDGLLQVPFRDAWPGTFSFnE 

TWPJBELGDQIGGPAWSLLARVAGRPvPXAAGGPWARDIQRAGAWELRFSYRARCEPPAVG 

TACTRLCRPRSAPSRCGPGLRPCAPLEDECEAPLVCRAGCSPEHGFCEQPGECRCLEGWTG 

PLCTVPVSTSSCLSPRGPSSATTGCLVPGPGPCDGNPCANGGSCSETPRSFECTCPRGFYGLR 

CEVSGVTCADGPCFNGGLCVGGADPDSAYICHCPPGFQGSNCEKRVDRCSLQPCRNGGLC 

LDLGHALRCRCRAGFAGPRCEHDLDDCAGRACANGGTCVEGGGAHRCSCALGFGGRDCR 

ERADPCAARPCAHGGRCYAHFSGLVCACAPGYMGARCEFPVHPDGASALPAAPPGLRPG 

DPQRYLLPPALGLLVAAGVAGAALLLVHVRRRGHSQDAGSRLLAGTPEPSVHALPDALNN 

LRTQEGSGDGPSSSVDWNRPEDVDPQGrYVISAPSIYAREVATPLFPPLHTGRAGQRQHLLF 

PYPSSILSVK 

Figure 14 

AAACCGGAACGGGGCCCAACTTCTGGGGCCTGGAGAAGGGAAACGAAGTCCCCCCCG 

GTTTCCCGAGGTGCCTTTCCTCGGGCATCCTTGGTTTCGGCGGGACTTCGCAGGGCGGA 

TATAAAGAACGGCGCCTTTGGGAAGAGGCGGAGACCGGCTTTAAAGAAAGAAGTCTTG 

GTCCTGCGGCTTGGGCGAGGCAAGGGCGAGGCAGGGCGCTTTCTGCCGACGCTCCCCG 

TGGCCCTACGATCCCCCGCGCGTCCGCCGCTGTTCTAAGGAGAGAAGTGGGGGCCCCC 

CAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGT 

GCTCTCGGCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGG 

AGTTCGTCAACAAGAAGGGGCTGCTGGGGAACCGCAACTGCTGCCGCGGGGGCGCGG 

GGCCACCGCCGTGCGCCTGCCGACCTTCTTCCGCGTGTGCCTCAAGCACTACCAGGCCA 

GCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAGCGCCGTCACCCCCGTGCTGGGCGT 

CGACTCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACTCCGCGTTCAGCAACCCCATC 

CGCTTCCCCTTCGGCTTCACCTGGCCGGGCACCTTCTCTCTGATTATTGAAGCTCTCC 

ACACAGATTCTCCTGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCCGCCT 

GGCCACCCAGAGGCACCTGACGGTGGGCGAGGAGTGGTCCCAGGACCTGCACAGCAG 

CGGCCGCACGGACCTCAAGTACTCTACCGCTTCGTGTGTGACGAACACTACTACGGAG 

AGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACGATGCCTTCGGCCACTTCACCTGTGGG 

GAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCTACTGCACAGAGCCG 
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ATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAATGCA 

AGTGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGCTG 

TCTCCATGGCACCTGCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGC 

CTTTTCTGCAACCAGGACCTGAACTACTGCACACACCATAAGCCCTGCAAGAATGGAG 

CCACCTGCACCAACACGGGCCAGGGGAGCTACACTTCTCTTGCCGGCCTGGGTACACA 

GGTGCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTAAGAACGGAG 

GGAGCTGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGG 

CAAAATCTGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTC 

GGTGCTCAGACAGCCCCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGG 

CTTCAACTGTGAGAAGAAAATTGACTACTGCAGCTCTTCACCCTGTTCTAATGGTGCCA 

AGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTGCCAGGCCGGCTTCTCGGGGAG 

GCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAACGGGGGCACC 

TGCCGGGATGGCGTGAACGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGA 

ACTGCAGTGCCCCCGTCAGCAGGTGCGAGCACGCACCCTGCCACAATGGGGCCACCTG 

CCACCAGAGGGGCACGGCTATGTGTGCGAATGTGCCCGAAGCTACGGGGGTCCCAACT 

GCCAGTTCCTGCTCCCCGAGCTGCCCCCGGGCCCAGCGGTGGTGGACCTCACTGAGAA 

GCTAGAGGGCCAGGGCGGGCCATTCCCCTGGGTGGCGTGTGCGCCGGGGTCATCCTTG 

TCCTCATGCTGCTGCTGGGCTGTGCCGCTGTGGTGGTCTGCGTCCGGCTGAGGCTGCAG 

AAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGAGACCATGAACAACCTG 

GCAACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGCCACGCAGATCAA 

GAACACCAACAAAAGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGCTTC 

AAGGCCCGCTACCCAGCGGTGGACATAACCTCGTGCAGGACCTCAAGGGTGACGACAC 

CGCCGTCAGGGACGCGCACAGCAAGCGTGACACCAGTGCCAGCCCCAGGGCTCCTCAG 

GGGAGGAGAAGGGGACCCCGACCACACTCAGGGGTGGAGAAGCATCGAAAGAAAAA 

GGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCAT 

ATCCGAGGAGAAGGATGAGTGCGTCATAGCAACTGAGGTGTAAAATGGAAGTGAGAT 

GGCAAGACTCCCGTTCTCTTAAAATAAGTAAAATTCCAAGGATATATGCCCCAACGAA 

TGCTGCTGAAGAGGAGGGAGGCCTCGTGGACTGCTGCTGAGAAACCGAGTTCAGACCG 

AGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGACAGCCTGTCGCGGCCCGGCCGC 

CTGC GGC ACTGCCTTC C GTGACGTCGC CGTTGC ACTATGGAC AGTTGCTCTT AAGAGAA 

TATATATTTAAATGGGTGAACTGAATTACGCCTAAGAAGCATGCACTGCCTGAGTGTAT 

ATTTTGGATTCTTATGAGCCAGTCTTTTCTTGAATTAGAAACACAAACACTGCCTTTATT 

GTCCTTTTTGATACGAAGATGTGCTTTTTCTAGATGGAAAAGATGTGTGTTATTTTTTGG 

ATTTGTAAAAATATTTTTCATGATATCTGTAAAGCTTGAGTATTTTGTGATGTTCGTTTT 

TTATAATTTAAATTTTGGTAAATATGTACAAAGGCACTTCGGGTCTATGTGACTATATT 

TTTTTGTATATAAATGTATTTATGGAATATTGTGCCAATGTTATTTGAGTTTTTTACTGT 

TTTGTTAATGAAGAAATTCCTTTTTAAAATATTTTTCCAAAATAAATTTTATGAGGAATT 

C 

Figure 15 

MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNCCRGGAGPPPCACRTFF 

RVCLKEIYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADSAFSNPIRFPFGFTWPGTFSLI 

IEALHTDSPDDLATENPERLISRLATQRHLT VGEEWSQDLHS S GRTDLK YS YRFVCDEHYY 

GEGCSVFCRPRDDAFGHFTCGERGEKVCNPGWKGPYCTEPICLPGCDEQHGFCDKPGECK 

CRVGWQGRYCDECIRYPGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHHKPCKNGA 

TCTNTGQGSYTCSCRPGYTGATCELGIDECDPSPCKNGGSCTDLENSYSCTCPPGFYGKICE 

LSAMTCADGPCFNGGRCSDSPDGGYSCRCPVGYSGFNCEKKIDYCSSSPCSNGAKCVDLG 

DAYLCRCQAGFSGRHCDDNVDDCASSPCANGGTCRDGVNDFSCTCPPGYTGRNCSAPVSR 

CEHAPCHNGATCHQRGHGYVCECARSYGGPNCQFLLPELPPGPAWDLTEKLEGQGGPFP 

WAVCAGVILVLMLLLGCAAVWCVRLRLQKHRPPADPCRGETETMNN^^ 
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VSHGATQIKNTmKADFHGDHSADKNGFKARYPAVDYmVQDLKGDDTAVRDAHSKilD 
TKCQPQGSSGEEKGTPTTLRGGEASERKRPDSGCSTSKDTKYQSVYVISEEKDECVIATEV 



Figure 16 

ATGGCGGCAGCGTCCCGGAGCGCCTCTGGCTGGGCGCTACTGCTGCTGGTGGCACTTT 

GGCAGCAGCGCGCGGCCGGCTCCGGCGTCTTCCAGCTGCAGCTGCAGGAGTTCATCAA 

CGAGCGCGGCGTACTGGCCAGTGGGCGGCCTTGCGAGCCCGGCTGCCGGACTTTCTTC 

CGCGTCTGCCTTAAGCACTTCCAGGCGGTCGTCTCGCCCGGACCCTGCACCTTCGGGAC 

GTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACGACAGTAGCGGCG 

GGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCTTCTCGCT 

CATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCACCA 

GATGCACTCATCAGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGT 

TATTGGATGAGCAAACCAGCACCCTCACAAGGCTGCGCTACTCTTACCGGGTCATCTGC 

AGTGACAACTACTATGGAGACAACTGCTCCCGCCTGTGCAAGAAGCGCAATGACCACT 

TCGGCCACTATGTGTGCCAGCCAGATGGCAACTTGTCCTGCCTGCCCGGTTGGACTGGG 

GAATATTGCCAACAGCCTATCTGTCTTTCGGGCTGTCATGAACAGAATGGCTACTGCA 

GCAAGCCAGCAGAGTGCCTCTGCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATG 

CATCCCCCACAATGGCTGTCGCCACGGCACCTGCAGCACTCCCTGGCAATGTACTTGTG 

ATGAGGGCTGGGGAGGCCTGTTTTGTGACCAAGATCTCAACTACTGCACCCACCACTC 

CCCATGCAAGAATGGGGCAACGTGCTCCAACAGTGGGCAGCGAAGCTACACCTGCACC 

TGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGAGTGTGACAGCA 

ACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCCTGTG 

TCCTCCGGGCTACTATGGCCTGCATTGTGAACACAGCACCTTGAGCTGCGCCGACTCCC 

CCTGCTTCAATGGGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTTGTGA 

ATGTCCCCCCAACTTCACCGGCTCCAACTGCGAGAAGAAAGTGGACAGGTGCACCAGC 

AACCCCTGTGCCAACGGGGGACAGTGCCTGAACCAGGTCCAAGCCGCATGTGCCGCTG 

CCGTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACTGTGCCCGTAA.ee 

CTTGCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCC 

TGCCGGCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGA 

GTCCCTGCTTCAACAGGGCCACCTGCTACACCGACCTCTCCACAGACACCTTTGTGTGC 

AACTGCCCTTATGGCTTTGTGGGCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAG 

CTTCCCCTGGGTGGCCGTCTCGCTGGGTGTGGGGCTGGCAGTGCTGCTGGTACTGCTGG 

GCATGGTGGCAGTGGCTGTGCGGCAGCTGCGGCTTCGACGGCCGGACGACGGCAGCAG 

GGAAGCCATGAACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCTGCCGCCAGC 

TTAAAAACACAAACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCA 

ACTGTGGCAACAGCAAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGG 

CGGGGGACCATGCCAGGAAGTTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGC 

CACTGCGGTTACACAGTGAAAAGCCAGAGTGCGGATATCAGCGATATGCTCCCCCAGG 

GACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAATGAATGTGTCATTGC 

CACGGAGGTATAA 

r 

Figure 17 

* 

MAAASRS AS GW ALLLLV ALWQQRAAGS GVF QLQLQEFINERGVLAS GRPCEP GCRTFFRV 

CLKHFQAWSPGPCTFGTVSTPVLGTNSFAVRDDSSGGGRNPLQLPFNFTWPGTFSLIIEAW 

HAPGDDLRPEALPPDALISKIAEQGSLAVGQNWLLDEQTSTLTEERYSYRVICSDNYYGDN 

CSRLCKKE^XDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECLCRP 

GWQGRLCNECIPHNGCRHGTCSTPWQCTCDEGWGGLFCDQDLNYCTHHSPCKNGATCSN 

SGQRSYTCTCRPGYTGVDCELELSECDSNPCRNGGSCKTJQEDGYHCLCPPGYYGLHCEHS 

TLS CAD SP CFNGGS CRERNQGANY ACECPPNFTGSNCEKKVDRCTSNPC ANGGQCLNR 
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GPSRMCRCI^GFTGTYCELHVSDCAiySIPCAHGGTCHDLENGLMCTCPAGFSGRRCEVRTS 
]DACASSPCF1<IP^TCYTT)LSTDTFVCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAVLL 

VLLGMVAVAVRQLRERRPDDGSREAMlSn^ 

KSNCGKQQNHTLDYNLAPGPLGRGTMPGKEPHSDKSLGEK^PLRLHSEKPECRISAICSPR 
DSMYQSVCLISEERNECVIATEV 

Figure 18 

CTCGCAGGCTAGGAACCCGAGGCCAAGAGCTGCAGCCAAAGTCACTTGGGTGCAGTGT 

ACTCCCTCACTAGCCCGCTCGAGACCCTAGGATTTGCTCCAGGACACGTACTTAGAGCA 

GCCACCGCCCAGTCGCCCTCACCTGGATTACCTACCGAGGCATCGAGCAGCGGAGTTT 

TTGAGAAGGCGACAAGGGAGCAGCGTCCCGAGGGAATCAGCTTTTCAGGAACTCGGCT 

GGCAGACGGGACTTGCGGGAGAGCGACATCCCTAACAAGCAGATTCGGAGTCCCGGA 

GTGGAGAGGACACCCCAAGGGATGACGCCTGCGTCCCGGAGCGCCTGTCGCTGGGCGT 

ACTGCTGCTGGCGGTACTGTGGCCGCAGCAGCGCGCTGCGGGCTCCGGCATCTTCCAG 

CTGCGGCTGCAGGAGTTCGTCAACCAGCGCGGTATGCTGGCCAATGGGCAGTCCTGCG 

AACCGGGCTGCCGGACTTTCTTCCGCATTTGCCTTAAGCACTTCCAGGCAACCTTCTCC 

GAGGGACCCTGCACCTTTGGCAATGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGT 

CGTCAGGGACAAGAATAGCGGCAGTGGTCGCAACCCTCTGCAGTTGCCCTTCAATTTC 

ACCTGGCCGGGAACCTTCTCACTCAACATCCAAGCTTGGCACACACCGGGAGACGACC 

TGCGGCCAGAGACTTCGCCAGGAAACTCTCTCATCAGCCAAATCATCATCCAAGGCTC 

TCTTGCTGTGGGTAAGATTTGGCGAACAGACGAGCAAAATGACACCCTCACCAGACTG 

AGCTACTCTTACCGGGTCATCTGCAGTGACAACTACTATGGAGAGAGCTGTTCTCGCCT 

ATGCAAGAAGCGCGATGACCACTTCGGACATTATGAGTGCCAGCCAGATGGCAGCCTG 

TCCTGCCTGCCGGGCTGGACTGGGAAGTACTGTGACCAGCCTATATGTCTTTCTGGCTG 

TCATGAGCAGAATGGTTACTGCAGCAAGCCAGATGAGTGCATCTGCCGTCCAGGTTGG 

CAGGGTCGCCTGTGCAATGAATGTATCCCCCACAATGGCTGTCGTCATGGCACCTGCA 

GCATCCCCTGGCAGTGTGCCTGCGATGAGGGATGGGGAGGTCTGTTTTGTGACCAAGA 

TCTCAACTACTGTACTCACCACTCTCCGTGCAAGAATGGATCAACGTGTTCCAACAGTG 

GGCCAAAGGGTTATACCTGCACCTGTCTCCCAGGCTACACTGGTGAGCACTGTGAGCT 

GGGACTCAGCAAGTGTGCCAGCAACCCCTGTCGAAATGGTGGCAGCTGTAAGGACCAG 

GAGAATAGCPACCACTGCCTGTGTCCCCCAGGCTACTATGGCCAGCACTGTGAGCATA 

GTACCTTGACCTGTGCGGACTCACCCTGCTTCAATGGGGGCTCTTGCCGGGAGCGCAAC 

CAGGGGTCCAGTTATGCCTGCGAATGCCCCCCCAACTTTACCGGCTCTAACTGTGAGAA 

GAAAGTAGACAGGTGTACCAGCAACCCGTGTGCCAATGGAGGCCAGTGCCTGAACAG 

AGGTCCAAGCCGAACCTGCCGCTGCCGGCCTGGATTCACAGGCACCCACTGTGAACTG 

CACATCAGCGATTGTGCCCGAAGTCCCTGTGCCCACGGGGGCACTTGCCACGATCTGG 

AGAATGGGCCTGTGTGCACCTGCCCCGCTGGCTTCTCTGGCAGGCGCTGCGAGGTGCG 

GATAACCCACGATGCCTGTGCCTCCGGACCCTGCTTCAATGGGGCCACCTGCTACACTG 

GCCTCTCCCCAAACAACTTCGTCTGCAACTGTCCTTATGGCTTTGTGGGCAGCCGCTGC 

GAGTTTCCCGTGGGCTTGCCACCCAGCTTCCCCTGGGTAGCTGTCTCGCTGGGCGTGGG 

GCTAGTGGTACTGCTGGTGCTGCTGGTCATGGTGGTAGTGGCTGTGCGGCAGCTGCGGC 

TTCGGAGGCCCGATGACGAGAGCAGGGAAGCCATGAACAATCTGCAGACTTCCAGAA 

GGACAACCTAATCCCTGCCGCCCAGCTCAAAAACACAAACCAGAAGAAGGAGCTGGA 

GTGGACTGTGGTCTGGACAAGTCCAATTGTGGCAAACTGCAGAACCACACATTGGACT 

ACAATCTAGCCCCGGGACTCCTAGGACGGGGCAGCATGCeTGGGAAGTATCCTCACAG 

TGACAAGAGCTTAGGAGAGAAGTGCCACTTCGGTTACACAGTGAGAAGCCAGAGTGTC 

GAATATCAGCCATTTGCTCTCCCAGGGACTCTATGTACCAATCAGTGTGTTTGATATCA 

GAAGAGAGGAACGAGTGTGTGATTGCCACAGAGGTATAAGGCAGAGCCTACTCAGAC 

ACCCAGCTCCGGCCCAGCAGCTGGGCCTTCCTTCTGCATTGTTTACATTGCATCCTGT 

ATGGGACATCTTTAGTATGCACAGTGCTGCTCTGCGGAGGAGGAGGGAATGGCATGAA 

CTGAACAGACGTGAACCCGCCAAGAGTTGCACCGGCTCTGCACACCTCCAGGAGTCTG 

SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



16/66 



PCT/GB02/01195 



CCTGGCTTCAGATGGGCAGCCCCGCCAAGGGAACAGAGTTGAGGAGTTAGAGGAGCAT 

CAGTTGAGCTGATATCTAAGGTGCCTCTCGAACTTGGACTTGCTCTGCCAACAGTGGTC 

ATCATGGAGCTCTTGACTGTTCTCCAGAGAGTGGCAGTGGCCCTAGTGGGTCTTGGCGC 

TGCTGTAGCTCCTGTGGGCATCTGTATTTCCAAAGTGCCTTTGCCCAGACTCCATCC 

TCACAGCTGGGCCCAAATGAGAAAGCAGAGAGGAGGCTTGCAAAGGATAGGCCTCCC 

GCAGGCAGAACGCCTTGGAGTTTGGCATTAAGCAGGAGCTACTCTGCAGGTGAGGAAA 

GCCCGAGGAGGGGACACGTGTGCTCCTGCCTCCAACCCCAGCAGGTGGGGTGCCACCT 

GCAGCCTCTAGGCAAGAGTTGGTCCTTCCCCTGGTCCTGGTGCCTCTGGGCTCATGTGA 

ACAGATGGGCTTAGGGCACGCCCCTTTTGCCAGCCAGGGGTACAGGCCTCACTGGGGA 

GCTCAGGGCCTTCATGCTAAACTCCCAATAAGGGAGATGGGGGGAAGGGGGCTGTGC 

CTAGGCCCTTCCCTCCCTCACACCCATTTTTGGGCCCTTGAGCCTGGGCTCCACCAGTG 

CCCACTGTTGCCCCGAGACCAACCTTGAAGCCGATTTTCAAAAATCAATAATATGAGGT 

TTTGTTTTGTAGTTTATTTTGGAATCTAGTATTTTGATAATTTAAGAATCAGAAGCACTG 

GCCTTTCTACATTTTATAACATTATTTTGTATATAATGTGTATTTATAATATGAAACAGA 

TGTGTACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Figure 19 

mtp asrs acrwallll a vlwp qqraags gifqlrlqef vnqrgmlangqs cep gcrtffr 

iclkotqAtfsegpctfgnvstpvlgtosfvvrdknsgsgrotlqlpfottwpgtfslniqa 

whtpgddlrpetspgnslisqinqgslavgkiwrtdeqndtltrlsysyrvicsdnyygesc 

srlckxrddhfghyecqpdgslsclpgwtgkycdqpiclsgcheqngycskpdecicrpg 

wqgrlcneciphngcrhgtcsipwqcacdegwgglfcdqdlnycthhspckngstcsns 

gpkgytctclpgytgehcelglskcasnpcrnggsckdqensyhclcppgyygqhcehst 

ltcadspcfnggscrernqgssyacecppnftgsncekkvdrctsnpcanggqclnrgpsr 

tcrcrpgftgthcelhisdcarspcahggtchdlengpvctcpagfsgrrcevrithdaca 

sgpcfngatcytglspnnfvcncpygfvgsrcefpvglppsfpwvavslgvglwllvllv 

mvwawqlrlrrpddesream>mlsdfqki)ne^ 

gklqnhtldynlapgllgrgsmpgkyphsde^lgekwlrlhsekpecrisaicsprdsmy 
qsvcliseernecviatev 
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Figure 20 




Western blot analysis of Notch 2 expression in human germ cell tumour derived cell 
lines. 

Western blot probed with antibody specific for the intracellular portion of human 
NOTCH2 and visualised using chemilumenesence. Lanes from left to right 1: BeWo, 2: 
TERA-1, 3: 833KE, 4: 2102 Ep 2A6, 5: 2102 Ep 4D3, 6: NTERA2/D1 8 days exposure 
to retinoic acid, 7: NTERA2/D1 EC cells. Molecular weight markers are indicated on the 
right in kDa. Notch2 protein product is visualized at apprx 100 kDa 
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Figure 22 

AAACCCACTCCACCTTACTACCAGACAACCTTAGCCAAACCATTTACCCAAATAAAGT 

ATAGGCGATAGAAATTGAAACCTGGCGCAATAGATATAGTACCGCAAGGGAAAGATG 

AAAAATTATAACCAAGCATAATATAGCAAGGACTAACCCCTATACCTTCTGCATAATG 

AATTAACTAGAAATAACTTTGCAAGGAGAGTCAAAGCTAAGGCCGCCGAAACCAGGCG 

AGCTACCTAAGAACAGCTAAAAGAGCACACCCGTCTATGTAGCAAAATAGTGGGAAG 

ATTTATAGGTAGAGGCGACAAACCTACCGAGCCTGGTGATAGCTGGTTGTCCAAGATA 

GAATCTTAGTTCAACTTTAAATTTGCCCACAGAACCCTCTAAATCCCCTTGTAAATTTA 

ACTGTTAGTCCAAAGAGGAACAGCTCTTTGGACACTAGGAAAAAACCTTGTAGAGAGA 

GTGTCAGCCCAATTCCACACTTTTCCACATGTTGGATGGCCTTGGAGTGGTAGCCATAA 

GCATTTTTGGAATTCAACTAAAAACTGAAGGATCCTTGAGGACGGCAGTACCTGGCAT 

ACCTACACAGTCAGCGTTCAACAAGTGTTTGCAAAGGTACATTGGGGCACTGGGGGCA 

CGAGTGATCTGTGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGC 

GTTACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTG 

TCAGCACCAATTCCGCCACCACCGCTGGAACTGTACCACCCTGGACCGGGACCACACC 

GTCTTTGGCCGTGTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTC 

ATCAGCAGGGGTGATCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTG 

TGCAGCTGTGACCCCTACACCCGTGGCCGACACCATGACCAGCGTGGGACTTTTGACT 

GGGGTGGCTGCAGTGACAACATCCACTACGGTGTCCGTTTTGCCAAGGCCTTCGTGGAT 

GCCAAGGAGAAGAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCT 

GTGGTCGCACGGCTGTGCGGCGGTTTGTCAAGCTGGAGTGTAAGTGCCATGGCGTGAG 

TGGTTCCTGTACTCTGCGCACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTG 

ATTACCTGCGGCGACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCAAGATGGTGC 

CAACTTCACCGCAGCCCGCCAAGGCTATCGCCGTGCCACCCGGAGTGATCTTGTCTACT 

TTGACAACTCTCCAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCA 

GGCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTG 

GCCGAGGGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCA 

CTGGTGCTGTGCTGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGC 

AAAGCCCCCAAGAAGGCAGAGTGGCTGGACCAGACCTGAACACACAGATACCTCACT 

CATCCCTCCAATTCAAGCCTCTCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTC 

CTCCACCCTCCACCCTGGGCTGCTACCGCTTCTATTTAAGGATGTAGAGAGTAATCCAT 

AGGGACCATGGTGTCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTCAGGGGATAT 

AAGAAACTGTGCAAGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAG 

AGATAGGGGGTCTTTAGAGTGAAATGAGTTGCACTAAAGTACGTAGTTGAGGCTCCTT 

TTTTCTTTCCTTTGCACCAGCTTCCCGACACTTCTTGGTGTGCAAGAGGAAGGGTACCT 

GTAGAGAGCTTCTTTTTGTTTCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATGGC 

ATGTCCCTTCTCTGAAGTCCGTTTGAGCAGAACTACCTGGTACCCCGAAAGAAAAATCT 

TAGGCTACCACATTCTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCC 

ATTCACATGCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAAT 

ACAAACACACATTCATTCTCTCTTTTTCTCTCTACCATTCTCAACCTGTATTGGACAGCA 

CTGCCTCTTTTGCTTACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATG 

CTTAACAGATCATTAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 
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Figure 26 

ACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACGGTTTCGTGGGGA 

CCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTTGAGTCCTTCTGAG 

ATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGG 

CTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATTCC 

AACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTG 

CAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGA 

CAACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGC 

GCTAGTCCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGC 

GCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGG 

AATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTG 

AAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCT 

TCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACT 

GTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTG 

AAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATA 

TTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATCA 

AGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAAACCAGCTATCCAAATG 

CAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATCAACTC 

AATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTTCCAA 

AAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTGTGATTGCAGTAAATT 

ACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACTTTTAATTA 

TTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATT 

GATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATTTCAGCTTA 

TAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAAGGAT 

CTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATTTT 

CTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 

AATAAAATTTAACATTTAATATCATGAAATGTTATAA 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



24/66 



PCT/GB02/01195 



Figure 27 



agaaagcgggagcccgcggcgagcgtagcgcaagtccgctccctaggcatcgctgcgc 

tggcagcgattcgctgtctcttgtgagtcaggggacaacgcttcgGggcaactgtgag 

tgcgcgtgtgggggacctcgattctcttcagatctcgaggattcggtccggggacgtct 

cctgatcccctactaaagrcgcctgctaactttgaaaaggagcactgtgtcctgcaaagt 

ttgacacataaaggataggaaaagagaggagagaaaagcaactgagttgaaggagaa 

ggagctgatgcgggcctcctgatcaattaagaggagagttaaaccgccgagatcccgg 

cgggaccaaggaggtgcggggcaagaaggaacggaagcggtgcgatccacagggctg 

ggttttcttgcaccttgggtcacgcctccttggcgagaaagcgcctcgcatttgattgc 

ttccagttattgcagaacttcctgtcctggtggagaagcgggtctcgcttgggttccgc 

taatttctgtcctgaggcgtgagactgagttcatagggtcctgggtccccgaaccagga 

agggttgagggaacacaatctgcaagcccccgcgacccaagtgaggggccccgtgttg 

gggtcctccctccctttgcattcccacccctccgggctttgcgtcttcctggggaccccc 

tcgccgggagatggccgcgttgatgcggagcaaggattcgtcctgctgcctgctccta 

ctggccgcggtgctgatggtggagagctcacagatcggcagttcgcgggccaaactca 

actccatcaagtcctctctgggcggggagacgcctggtcaggccgccaatcgatctgc 

gggcatgtaccaaggactggcattcggcggcagtaagaagggcaaaaacctggggca 

ggcctacccttgtagcagtgataaggagtgtgaagttgggaggtattgccacagtccc 

caccaaggatcatcggcctgcatggtgtgtcggagaaaaaagaagcgctgccaccga 

gatggcatgtgctgccccagtacccgctgcaataatggcatctgtatcccagttactga 

aaggatcttaacccctcacatcccggctctggatggtactcggcacagagatcgaaac 

cacggtcattactcaaaccatgacttgggatggcagaatctaggaagaccacacacta 

agatgtcacatataaaagggcatgaaggagacccctgcctacgatcatcagactgcat 

tgaagggttttgctgtgctcgtcatttctggaccaaaatctgcaaaccagtgctccatc 

agggggaagtctgtaccaaacaacgcaagaagggttctgatgggctggaaattttcca 

GCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCC 

TCCAAAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCA 

ATTGCAGACTGTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCA 

GATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAAAAA 

GGGAGAAAGAAAACATGAACTGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCC 

AGTGTTTCCATTATGCAACTTGTCTATGTAAATAATGTACACATTTGTGGAAAATGCTA 

TTATTAAGAGAACAAGCACACAGTGGAAATTACTGATGAGTAGCATGTGACTTTCCAA 

GAGTTTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAGATTGCTGATTGCTTATACAAATA 

ACCTACATGCCAGATTTCTATTCAACGTTAGAGTTTAACAAAATACTCCTAGAATAACT 

TGTTATACAATAGGTTCTAAAAATAAAATTGCTAAACAAGAAATGAAAACATGGAGCA 

TTGTTAATTTACAACAGAAAATTACCTTTTGATTTGTAACACTACTTCTGCTGTTCAATC 

AAGAGTCTTGGTAGATAAGAAAAAAATCAGTCAATATTTCCAAATAATTGCAAAATAA 

TGGCCAGTTGTTTAGGAAGGCCTTTAGGAAGACAAATAAATAACAAACAAACAGCCAC 

AAATACTTTTTTTTCAAAATTTTAGTTTTACCTGTAATTAATAAGAACTGATACAAGAC 

AAAAACAGTTCCTTCAGATTCTACGGAATGACAGTATATCTCTCTTTATCCTATGTGAT 

TCCTGCTCTGAATGCATTATATTTTCCAAACTATACCCATAAATTGTGACTAGTAAAAT 

ACTTACACAGAGCAGAATTTTCACAGATGGCAAAAAAATTTAAAGATGTCCAATATAT 

GTGGGAAAAGAGCTAACAGAGAGATCATTATTTCTTAAAGATTGGCCATAACCTGTAT 

TTTGATAGAATTAGATTGGTAAATACATGTATTCATACATACTCTGTGGTAATAGAGAC 

TTGAGCTGGATCTGTACTGCACTGGAGTAAGCAAGAAAATTGGGAAAACTTTTTCGTTT 

GTTCAGGTTTTGGCAACACATAGATCATATGTCTGAGGCACAAGTTGGCTGTTCATCTT 

TGAAACCAGGGGATGCACAGTCTAAATGAATATCTGCATGGGATTTGCTATCATAATA 

TTTACTATGCAGATGAATTCAGTGTGAGGTCCTGTGTCCGTACTATCCTCAAATTATTTA 

TTTTATAGTGCTGAGATCCTCAAATAATCTCAATTTCAGGAGGTTTCACAAAATGGACT 
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CCTGAAGTAGACAGAGTAGTGAGGTTTCATTGCCCTCTATAAGCTTCTGACTAGCCAAT 

GGCATCATCCAATTTTCTTCCCAAACCTCTGCAGCATCTGCTTTATTGCCAAAGGGCTA 

GTTTCGGTTTTCTGCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTAAAACC 

TGCATATTGCTAATCTATAGACACCACAGTTTCTAAATTCTTTGAAACCACTTTACTACT 

TTTTTTAAACTTAACTCAGTTCTAAATACTTTGTCTGGAGCACAAAACAATAAAAGGTT 

ATCTTATAGTCGTGACTTTAAA.CTTTTGTAGACCACAATTCACTTTTTAGTTTTCTTTTA 

CTTAAATCCCATCTGCAGTCTCAAATTTAAGTTCTCCCAGTAGAGATTGAGTTTGAGCC 

TGTATATCTATTAAAAATTTCAACTTCCCACATATATTTACTAAGATGATTAAGACTTA 

CATTTTCTGCACAGGTCTGCAAAAACAAAAATTATAAACTAGTCCATCCAAGAACCAA 

AGTTTGTATAAACAGGTTGCTATAAGCTTGGTGAAATGAAAATGGAACATTTCAATCA 

AACATTTCCTATATAACAATTATTATATTTACAATTTGGTTTCTGCAATATTTTTCTTAT 

GTCCACCCTTTTAAAAATTATTATTTGAAGTAATTTATTTACAGGAAATGTTAATGAGA 

TGTATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATTTGCAGCT 

ATTGACTTTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTTTTCTCC 

TCTAAAAACTGAATTCAAAGC 



Figure 28 



ACACACAGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCT 

GCTGTGCCTGCTGCTGGCGGCGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCG 

ACCTCGGCTCCAGTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCC 

TCAATGAGATGTTCCGCGAGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCG 

CAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGT 

GAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGACACGAAGGTT 

GGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGACTG 

GACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAG 

GAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCC 

AGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACA 

GTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAG 

GGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 

GCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCT 

TTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATG 

GAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGC 

CTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCC 

TGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 

CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCC 

TGCGGCTGCCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTG 

TGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTG 

GGCTGACCAGGCTJCTTCCTACATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGC 

ATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGG 

TGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGTCCAGA 

TTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGAT 

AATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTGTCCCTCTGATTGGTTTTGGG 

GAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAA 

TGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCA 

GATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAG 
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ctcctacctctgtgccagggcagcattttcatatccaagatcaattccctctctcagca 

cagcctggggagggggtcattgttctcctcgtccatcagggatctcagaggctcagag 

actgcaagctgcttgcccaagtcacacagctagtgaagaccagagcagtttcatctgg 

ttgtgactctaagctcagtgctctctccactaccccacaccagccttggtgccaccaaa 

agtgctccccaaaaggaaggagaatgggatttttcttttgaggcatgcacatctggaa 

ttaaggtcaaactaattctcacatccctctaaaagtaaactactgttaggaacagcagt 

gttctcacagtgtggggcagccgtccttctaatgaagacaatgatattgacactgtccc 

tctttggcagttgcattagtaactttgaaaggtatatgactgagcgtagcatacaggtt 

aacctgcagaaacagtacttaggtaattgtagggcgaggattataaatgaaatttgca 

aaatcacttagcagcaactgaagacaattatcaaccacgtggagaaaatcaaaccgag 

cagggctgtgtgaaacatggttgtaatAtgcgactgcgaacactgaactctacgccac 

TCCACAAATGATGTTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTT 

CTTAAAGTTTAAAGTTGCACATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATG 

TATAAACATAAGTTGCATTTAGAAATCAAGCATAAATCACTTCAACTGCTCTTCT 



Figure 29 



GACAAACAGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGA 

GCAGCCTCGCTTTGGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAG 

GATGGTGGCGGCCGTCCTGCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGG 

TCCTGGACTTCAACAACATCAGGAGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTC 

ACAGTGCCTGTCTGACACGGACTGCAATACCAGAAAGTTCTGCCTCCAGCCCCGCGAT 

GAGAAGCCGTTCTGTGCTACATGTCGTGGGTTGCGGAGGAGGTGCCAGCGAGATGCCA 

TGTGCTGCCCTGGGACACTCTGTGTGAACGATGTTTGTACTACGATGGAAGATGCAACC 

CCAATATTAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGCAGAAGGAACAACT 

GGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAAGTATTAAGAAATCACAA 

GGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTGTGGCCCTGGAC 

TTTGCTGTGCTCGTCATTTTTGGACGAAAATTTGTAAGCCAGTCCTTTTGGAGGGACAG 

GTCTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGTT 

GCGACTGTGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGC 

TCGATTAAGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAA 

TCCACATTGCATTTGAG 



Figure 30 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTTCTGCTACGCTGCTGCTGGCGCTGGC 

CGCTCTGCCCGCAGCCCTGGCTGCCAACAGCAGTGGCCGATGGTGGGGTATTGTGAAC 

GTAGCCTCCTCCACGAACCTGCTTACAGACTCCAAGAGTCTGCAACTGGTACTCGAGCC 

CAGTCTGCAGCTGTTGAGCCGCAAACAGCGGCGCCTGATACGCCAAAATCCGGGGATC 

CTGCACAGCGTGAGTGGGGGGCTGCAGAGTGCCGTGCGCGAGTGCAAGTGGCAGTTCC 

GGAATCGCCGCTGGAACTGTCCCACTGCTCCAGGGCCCCACCTCTTCGGCAAGATCGTC 

AACCGAGGCTGTCGAGAAACGGCGTTTATCTTCGCTATCACCTCCGCCGGGGTCACCC 

ATTCGGTGGCGCGCTCCTGCTCAGAAGGTTCCATCGAATCCTGCACGTGTGACTACCGG 

CGGCGCGGCCCCGGGGGCCCCGACTGGCACTGGGGGGGCTGCAGCGACAACATTGACT 

TCGGCCGCCTCTTCGGCCGGGAGTTCGTGGACTCCGGGGAGAAGGGGCGGGACCTGCG 

CTTCCTCATGAACCTTCACAACAACGAGGCAGGCCGTACGACCGTATTCTCCGAGATG 

CGCCAGGAGTGCAAGTGCCACGGGATGTCCGGCTCATGCACGGTGCGCACGTGCTGGA 

TGCGGCTGCCCACGCTGCGCGCCGTGGGCGATGTGCTGCGCGACCGCTTCGACGGCGC 
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CTCGCGCGTCCTGTACGGCAACCGCGGCAGCAACCGCGCTTCGCGAGCGGAGCTGCTG 

CGCCTGGAGCCGGAAGACCCGGCCCACAAACCGCCCTCCCCCCACGACCTCGTCTACT 

TCGAGAAATCGCCCAACTTCTGCACGTACAGCGGACGCCTGGGCACAGCAGGCACGGC 

AGGGCGCGCCTGTAACAGCTCGTCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCTGC 

GGCAGGGGCCACCGCACGCGCACGCAGCGCGTCACCGAGCGCTGCAACTGCACCTTCC 

ACTGGTGCTGCCACGTCAGCTGCCGCAACTGCACGCACACGCGCGTACTGCACGAGTG 

TCTGTGA 



Figure 3 1 

MGLWALLP GWVS ATLLL AL AALP AALAANS SGRWWGIVNV AS S TNLLTD SKSLQLVLEP S 
LQLLSRKQRRIJRQNPGILHSVSGGLQSAVREC^^ 

PvETAFIFAITSAGVTHSVARSCSEGSIESCTCDYPJ^GPGGPDWHWGGCSDNIDFGRLFGRE 
FVDSGEKGRDLRFLMNLHNr^^ 

GDVLRDRFDGASRVLYGmGSNRASRAE]XRLEPEDPAHKPPSPHDLVYFEKSPNFCTYSG 
RLGTAGTAGRACNSSSPALDGCELLCCGRGHRTRTQRVTERCNCTFHWCCHVSCRNCTHT 
RVLHECL 



Figure 32 

AGCAGAGCGGACGGGCGCGCGGGAGGCGCGCAGAGCTTTCGGGCTGCAGGCGCTCGC 

TGCCGCTGGGGAATTGGGCTGTGGGCGAGGCGGTCCGGGCTGGCCTTTATCGCTCGCT 

GGGCCCATCGTTTGAAACTTTATCAGCGAGTCGCCACTCGTCGCAGGACCGAGCGGGG 

GGCGGGGGCGCGGCGAGGCGGCGGCCGTGACGAGGCGCTCCCGGAGCTGAGCGCTTC 

TGCTCTGGGCACGCATGGCGCCCGCACACGGAGTCTGACCTGATGCAGACGCAAGGGG 

GTTAATATGAACGCCCCTCTCGGTGGAATCTGGCTCTGGCTCCCTCTGCTCTTGACCTG 

GCTCACCCCCGAGGTCAACTCTTCATGGTGGTACATGAGAGCTACAGGTGGCTCCTCCA 

GGGTGATGTGCGATAATGTGCCAGGCCTGGTGAGCAGCCAGCGGCAGCTGTGTCACCG 

ACATCCAGATGTGATGCGTGCCATTAGCCAGGGCGTGGCCGAGTGGACAGCAGAATGC 

CAGCACCAGTTCCGCCAGCACCGCTGGAATTGCAACACCCTGGACAGGGATCACAGCC 

TTTTTGGCAGGGTCCTACTCCGAAGTAGTCGGGAATCTGCCTTTGTTTATGCCATCTCCT 

CAGCTGGAGTTGTATTTGCCATCACCAGGGCCTGTAGCCAAGGAGAAGTAAAATCCTG 

TTCCTGTGATCCAAAGAAGATGGGAAGCGCCAAGGACAGCAAAGGCATTTTTGATTGG 

GGTGGCTGCAGTGATAACATTGACTATGGGATCAAATTTGCCCGCGCATTTGTGGATGC 

AAAGGAAAGGAAAGGAAAGGATGCCAGAGCCCTGATGAATCTTCACAACAACAGAGC 

TGGCAGGAAGGCTGTAAAGCGGTTCTTGAAACAAGAGTGCAAGTGCCACGGGGTGAG 

CGGCTCATGTACTCTCAGGACATGCTGGCTGGCCATGGCCGACTTCAGGAAAACGGGC 

GATTATCTCTGGAGGAAGTACAATGGGGCCATCCAGGTGGTCATGAACCAGGATGGCA 

CAGGTTTCACTGTGGCTAACGAGAGGTTTAAGAAGCCAACGAAAAATGACCTCGTGTA 

TTTTGAGAATTCTCCAGACTACTGTATCAGGGACCGAGAGGCAGGCTCCCTGGGTACA 

GCAGGCCGTGTGTGCAACCTGACTTCCCGGGGCATGGACAGCTGTGAAGTCATGTGCT 

GTGGGAGAGGCTACGACACCTCCCATGTCACCCGGATGACCAAGTGTGGGTGTAAGTT 

CCACTGGTGCTGCGCCGTGCGCTGTCAGGACTGCCTGGAAGCTCTGGATGTGCACACA 

TGCAAGGCCCCCAAGAACGCTGACTGGACAACCGCTACATGACCCCAGCAGGCGTCAC 

CATCCACCTTCCCTTCTACAAGGACTCCATTGGATCTGCAAGAACACTGGACCTTTGGG 

TTCTTTCTGGGGGGATATTTCCTAAGGCATGTGGCCTTTATCTCAACGGAAGCCCCCTC 

TTCCTCCCTGGGGGCCCCAGGATGGGGGGCCACACGCTGCACCTAAAGCCTACCCTAT 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



28/66 



PCT/GB02/01195 



tctatccatctcctggtgttctgcagtcatctcccctcctggcgagttctctttggaaat 

agcatgacaggctgttcagccgggagggtggtgggcccagaccactgtctccacccac 

cttgacgtttcttctttctagagcagttggccaagcagaaaaaaaagtgtctcaaagg 

agctttctcaatgtcttcccacaaatggtcccaattaagaaattccatacttctctcag 

atggaacagtaaagaaagcagaatcaactgcccctgacttaactttaacttttgaaaa 

gaccaagacttttgtctgtacaagtggttttacagctaccacccttagggtaattggta 

attacctggagaagaatggctttcaatacccttttaagtttaaaatgtgtatttttcaa 

ggcatttattgccatattaaaatctgatgtaacaaggtggggacgtgtgtcctttggta 

ctatggtgtgttgtatctttgtaagagcaaaagcctcagaaagggattgctttgcatta 

ctgtccccttgatataaaaaatctttagggaatgagagttccttctcacttagaatctg 

aagGgaattaaaaagaagatgaatggtctggcaatattctgtaactattgggtgaata 

TGGTGGAAAATAATTTAGTGGATGGAATATCAGAAGTATATCTGTACAGATCAAGAAA 
AAAAGGAAGAATAAAATTCCTATATCAT 

Figure 33 

'f 

MNAPLGG1WLWLPLLLTWXTPEVNSSWWTMRATGGSSRVMCDNWGLVSSQRQLCH 

PDVMElAISQGVAEWTAECQHQFRQmWNCNTLDRDHSLFGRVLLRSSRESAFVYAISSAG 

VWAITRACSQGEVKSCSCDPKKMGSAKDSK^^ 

GKDARALMNLHNNRAGREAVKRFLKQECKCHGVSGSCTLRTCWLAMADFRKTGDYLW 
RKYNGMQVVMNQDGTGFWANERFKXPTKNDLVYFENSPDYCn 

NLTSRGMDSCEVMCCGRGYDTSHVTRMTKCGCKFHWCCAVRCQDCLEALDVHTCKAPK 
NADWTTAT 

Figure 34 

CGGGAGTCTTCGGGGAGCTATGCTGAGACCGGGTGGTGCGGAGGAAGCTGCGCAGCTC 

CCGCTTCGGCGCGCCAGCGCCCCGGTCCCTGTGCCGTCGCCCGCGGCCCCCGACGGCTC 

CCGGGCTTCGGCCCGCCTAGGTCTTGCCTGCCTTCTGCTCCTGCTGCTGCTGACGCTGC 

CGGCCCGCGTAGACACGTCCTGGTGGTACATTGGGGCACTGGGGGCACGAGTGATCTG 

TGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGCGTTACCCAGAC 

ATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTGTCAGCACCAAT 

TCCGCCACCACCGCTGGAACTGTACCACCCTGGACCGGGACCACACCGTCTTTGGCCGT 

GTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTCATCAGCAGGGG 

TAGTCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTGTGCAGCTGTGAC 

CCCTACACCCGTGGCCGACACCATGACCAGCGTGGGGACTTTGACTGGGGTGGCTGCA 

GTGACAACATCCACTACGGTGTCCGTTTTGCCAAGGCCTTCGTGGATGCCAAGGAGAA 

GAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCTGTGGTCGCACG 

GCTGTGCGGCGGTTTCTGAAGCTGGAGTGTAAGTGCCATGGCGTGAGTGGTTCCTGTAG 

TCTGCGCACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTGATTACCTGCGGC 

GACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCAAGATGGTGCCAACTTCACCGC 

AGCCCGCCAAGGCTATCGCCGTGCCACCCGGACTGATCTTGTCTACTTTGACAACTCTC 

CAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCAGGCCGTGTCTGC 

AGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTGGCCGAGGGTAC 

GACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCACTGGTGCTGTGC 

TGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGCAAAGCCCCCAAG 

AAGGCAGAGTGGCTGGACCAGACCTGAACACACAGATACCTCACTCATCCCTCCAATT 

CAAGCCTCTCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCTCCACCCTCCAC 

CCTGGGCTGCTACCGCTTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACCATGGTG 

TCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCA 

AGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAGAGATAGGGGGTCT 

TTAGAGTGAAATGAGTTGCACTAAAGTACGTAGTTGAGGCTCCTTTTTTCTTTCCTTTGC 
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ACCAGCTTCCCGACACTTCTTGGTGTGCAAGAGGAAGGGTACCTGTAGAGAGCTTCTTT 

TTGTTTCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATGGCATGTCCCTTCTCTGA 

AGTCCGTTTGAGCAGAACTACCTGGTACCCCGAAAGAAAAATCTTAGGCTAGCACATT 

CTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCCATTCACATGCTCAT 

ATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAATACAAACACACATT 

CATTCTCTCTTTTTCTCTCTACCATTCTCAACCTGTATTGGACAGCACTGCCTCTTTTGCT 

TACTTGCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATGCTTAACAGATCAT 

TAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 



Figure 35 

MLDGLGWAISIFGIQLKTEGSLRTAWGIPTQSAFlvnECCLQRYIGALGARVICDNlPGLVSRQ 

RQLCQRYPDIMRSVGEGAREWIRECQHQFRHHRWNCTTLDRDHTWGRVMLRSSREAA^ 

VYAISSAGVfflAITRACSQGELSVCSCDPYTRGRHHDQRGTFDWGGCSDNIBYGVRFAKAF 

VDAKEKRLKDARALMNLHNNRCGRTAVRRFVKLECKCHGVSGSCTLRTCWRALSDFRRT 

GDYERRRYDGAVQVMATQDGANFTAARQGYRRATRSDLVYFDNSPDYCVLDKAAGSLG 

TAGRVCSKTSKGTDGCEIMCCGRGYDTTRVTRVTQCECKFHWCCAVRCKECRNTVDVHT 

CKAPKKAE WLD QT 



Figure 36 

GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATT 

GGGGGCGTCGCTCCCCCTCYTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCT 

AATGGAGCCGCACCTGCTCGGGCTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCC 

TCGCTGGCTACCCAATTTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGC 

TCACAGCCCCTGCTCTGCGGCTCCATCCCAGGCCTGGTCCCCAAGCAACTGCGCTTCTG 

CCGCAATTACATCGAGATCATGCCCAGCGTGGCCGAGGGCGTGAAGCTGGGCATCCAG 

GAGTGCCAGCACCAGTTCCGGGGCCGCCGCTGGAACTGCACCACCATAGATGACAGCC 

TGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCGCGAGTCGGCCTTCGTTCACGCC 

ATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCTCCTGCGCCGAGGGCACCTCCAC 

CATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGCTGGAAGTGGGGC 

GGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCGGATGCGC 

GCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGGCC 

GCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAG 

CTGTGAGGTGAAGACCTGCTGGTGGGCGCAGCCTGACTTCCGTGCCATCGGTGACTTCC 

TCAAGGACAAGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCG 

AGGCTGGGTGGAGACCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGG 

GACCTGGTCTACTACGAGAACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGTT 

CCTTTGGCACAAGGGACCGGACTTGCAATGTCACCTCCCACGGCATCGATGGCTGCGA 

TCTGCTCTGCTGTGGCCGGGGCCACAACACGAGGACGGAGAAGCGGAAGGAAAAATG 

CCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGCCAGGAGTGTATTCGCATCTACG 

ACGTGCACACCTGCAAGTAGGGCACCAG 



Figure 37 
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MEPHLLGLLLGLLLGGTRVLAGYPIWWSLALGQQYTSLGSQPLLCGS3PGLVPKQLRFCRN 
YIEIMPSVAEGVKXGIQECQHQFRGP^WNCTTIDDSLAIFGPVLDKATRESAFVHAIASAGV 
AF AVTRS C AEGTSTICGCD SHHKGPP GEGWK WGGC SED ADFGVLVSKEF AD ARENRPD AR 
SAM]SIKH^MEAGRTTILDH^ 

EMVVEKHRESRGWVETLRAKYSLFKPPTERDLVYYENSPNFCEPNPETGSFGTRDRTClSrV 
SHGIDGCDLLCCGRGHNTRTEERKEKCHCI 



.Figure 38 

ATGAGTCCCCGCTCGTGCCTGCGTTCGCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCC 

GCCGCGAGCAACTGGCTGTACCTGGCCAAGCTGTCGTCGGTGGGGAGCATCTCAGAGG 

AGGAGACGTGCGAGAAACTCAAGGGCCTGATCCAGAGGCAGGTGCAGATGTGCAAGC 

GGAACCTGGAAGTCATGGACTCGGTGCGCCGCGGTGCCCAGCTGGCCATTGAGGAGTG 

CCAGTACCAGTTCCGGAACCGGCGCTGGAACTGCTCCACACTCGACTCCTTGCCCGTCT 

TCGGCAAGGTGGTGACGCAAGGGATTCGGGAGGCGGCCTTGGTGTACGCCATCTCTTC 

GGCAGGTGTGGCCTTTGCAGTGACGCGGGCGTGCAGCAGTGGGGAGCTGGAGAAGTGC 

GGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTCCAGTGGTCAGGATGCT 

CTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAGAGA 

AGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACAATGAGGCCGGCA 

GGAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTC 

CTGTGAGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACGCA 

CTGAAGGAGAAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCA 

GGGCACTGGTGCCACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACTTGGTGTA 

CTTGGAGCCTAGCCCCGACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACG 

AGGGGCCGCACATGCAACAAGACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCT 

GTGGCCGCGGCTTCCACACGGCGCAGGTGGAGCTGGCTGAACGCTGCAGCTGCAAATT 

CCACTGGTGCTGCTTCGTCAAGTGCCGGCAGTGCCAGCGGCTCGTGGAGTTGCACACG 

TGCCGATGA 

Figure39 

MSPRSCLRSLRLLWAWSAAAS1WLYLAKLSSVGSISEEETCEKI,KGLIQRQVQMCK^NL 

EVMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKVVTQGIEIEAALVYAISSAGVA 

FAVTRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRERSKGASSS 

RALMNLHNNEAGRKAILTHMRVECKCHGVS GS CEVKTC WRAVPPFRQ VGHALREKFDG 

ATEVEPRRVGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTS 

KAIDGCELLCCGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR 



Figure 40 

ATTAATTCTGGCTCCACTTGTTGCTCGGCCCAGGTTGGGGAGAGGACGGAGGGTGGCC 

GCAGCGGGTTCCTGAGTGAATTACCCAGGAGGGACTGAGCACAGCACCAACTAGAGA 

GGGGTCAGGGGGTGCGGGACTCGAGCGAGCAGGAAGGAGGCAGCGCCTGGCACCAGG 

GCTTTGACTCAACAGAATTGAGACACGTTTGTAATCGCTGGCGTGCCCCGCGCACAGG 

ATCCCAGCGAAAATCAGATTTCCTGGTGAGGTTGCGTGGGTGGATTAATTTGGAAAAA 

GAAACTGCCTATATCTTGCCATCAAAAAACTCACGGAGGAGAAGCGCAGTCAATCAAC 

AGTAAACTTAAGAGACCCCCGATGCTCCCCTGGTTTAACTTGTATGCTTGAAAATTATC 

TGAGAGGGAATAAACATCTTTTCCTTCTTCCCTCTCCAGAAGTCCATTGGAATATTAAG 

CCCAGGAGTTGCTTTGGGGATGGCTGGAAGTGCAATGTCTTCCAAGTTCTTCCTAGTGG 
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CTTTGGCCATATTTTTCTCCTTCGCCCAGGTTGTAATTGAAGCCAATTCTTGGTGGTCGC 

TAGGTATGAATAACCCTGTTCAGATGTCAGAAGTATATATTATAGGAGCACAGCCTCTC 

TGCAGCCAACTGGCAGGACTTTCTCAAGGACAGAAGAAACTGTGCCACTTGTATCAGG 

ACCACATGCAGTACATCGGAGAAGGCGCGAAGACAGGCATCAAAGAATGCCAGTATC 

AATTCCGACATCGACGGTGGAACTGCAGCACTGTGGATAACACCTCTGTTTTTGGCAGG 

GTGATGCAGATAGGCAGCCGCGAGACGGCCTTCACATACGCCGTGAGCGCAGCAGGG 

GTGGTGAACGCCATGAGCCGGGCGTGCCGCGAGGGCGAGCTGTCCACCTGCGGCTGCA 

GCCGCGCCGCGCGCCCCAAGGACCTGCCGCGGGACTGGCTCTGGGGCGGCTGCGGCGA 

CAACATCGACTATGGCTACCGCTTTGCCAAGGAGTTCGTGGACGCCCGCGAGCGGGAG 

CGCATCCACGCCAAGGGCTCCTACGAGAGTGCTCGCATCCTCATGAACCTGCACAACA 

ACGAGGCCGGCCGCAGGACGGTGTACAACCTGGCTGATGTGGCCTGCAAGTGCCATGG 

GGTGTCCGGCTCATGTAGCCTGAAGACATGCTGGCTGCAGCTGGCAGACTTCCGCAAG 

GTGGGTGATGCCCTGAAGGAGAAGTACGACAGCGCGGCGGCCATGCGGCTCAACAGC 

CGGGGCAAGTTGGTACAGGTCAACAGCCGCTTCAACTCGCCCACCACACAAGACCTGG 

TCTACATCGACCCCAGCCCTGACTACTGCGTGCGCAATGAGAGCACCGGCTCGCTGGG 

CACGCAGGGCCGCCTGTGCAACAAGACGTCGGAGGGCATGGATGGCTGCGAGCTCATG 

TGCTGCGGCCGTGGGTACGACCAGTTCAAGACCGTGCAGACGGAGCGCTGCCACTGCA 

AGTTCCACTGGTGCTGCTACGTCAAGTGCAAGAAGTGCACGGAGATCGTGGACCAGTT 

TGTGTGCAAGTAGTGGGTGCCACCCAGCACTCAGCCCCGCTCCCAGGACCCGCTTATTT 

ATAGAAAGTACAGTGATTCTGGTTTTTGGTTTTTAGAAATATTTTTTATTTTTCCCCAAG 

AATTGCAACCGGAACCATTTTTTTTCCTGTTACCATCTAAGAACTCTGTGGTTTATTATT 

AATATTATAATTATTATTTGGCAATAATGGGGGTGGGAACCACGAAAAATATTTATTTT 

GTGGATCTTTGAAAAGGTAATACAAGACTTCTTTTGGATAGTATAGAATGAAGGGGGA 

AATAACACATACCCTAACTTAGCTGTGTGGGACATGGTACACATCCAGAAGGTAAAGA 

AATACATTTTCTTTTTCTCAAATATGCCATCATATGGGATGGGTAGGTTCCAGTTGAAA 

GAGGGTGGTAGAAATCTATTCACAATTCAGCTTCTATGACCAAAATGAGTTGTAAATTC 

TCTGGTGCAAGATAAAAGGTCTTGGGAAAACAAAACAAAACAAAACAAACCTCCCTTC 

CCCAGCAGGGCTGCTAGCTTGCTTTCTGCATTTTCAAAATGATAATTTACAATGGAAGG 

ACAAGAATGTCATATTCTCAAGGAAAAAAGGTATATCACATGTCTCATTCTCCTCAAAT 

ATTCCATTTGCAGACAGACCGTCATATTCTAATAGCTCATGAAATTTGGGCAGCAGGGA 

GGAAAGTCCCCAGAAATTAAAAAATTTAAAACTCTTATGTCAAGATGTTGATTTGAAG 

CTGTTATAAGAATTGGGATTCCAGATTTGTAAAAAGACCCCCAATGATTCTGGACACTA 

GATTTTTTGTTTGGGGAGGTTGGCTTGAACATAAATGAAATATCCTGTATTTTCTTAGG 

GATACTTGGTTAGTAAATTATAATAGTAGAAATAATACATGAATCCCATTCACAGGTTT 

CTCAGCCCAAGCAACAAGGTAATTGCGTGCCATTCAGCACTGCACCAGAGCAGACAAC 

CTATTTGAGGAAAAACAGTGAAATCCACCTTCCTCTTCACACTGAGCCCTCTCTGATTC 

CTCCGTGTTGTGATGTGATGCTGGCCACGTTTCCAAACGGCAGCTCCACTGGGTCCCCT 

TTGGTTGTAGGACAGGAAATGAAACATTAGGAGCTCTGCTTGGAAAACAGTTCACTAC 

TTAGGGATTTTTGTTTCCTAAAACTTTTATTTTGAGGAGCAGTAGTTTTCTATGTTTTAA 

TGACAGAACTTGGCTAATGGAATTCACAGAGGTGTTGCAGCGTATCACTGTTATGATCC 

TGTGTTTAGATTATCCACTCATGCTTCTCCTATTGTACTGCAGGTGTACCTTAAAACTGT 

TCCCAGTGTACTTGAACAGTTGCATTTATAAGGGGGGAAATGTGGTTTAATGGTGCCTG 

ATATCTCAAAGTCTTTTGTACATAACATATATATATATATACATATATATAAATATAAA 

TATAAATATATCTCATTGCAGCCAGTGATTTAGATTTACAGCTTACTCTGGGGTTATCTC 

TCTGTCTAGAGCATTGTTGTCCTTCACTGCAGTCCAGTTGGGATTATTCCAAAAGTTTTT 

TGAGTCTTGAGCTTGGGCTGTGGCCCCGCTGTGATCATACCCTGAGCACGACGAAGCA 

ACCTCGTTTCTGAGGAAGAAGCTTGAGTTCTGACTCACTGAAATGCGTGTTGGGTTGAA 

GATATCTTTTTTTCTTTTCTGCCTCACCCCTTTGTCTCCAACCTCCATTTCTGTTCACTTT 

GTGGAGAGGGCATTACTTGTTCGTTATAGACATGGACGTTAAGAGATATTCAAAACTC 

AGAAGCATCAGCAATGTTTCTCTTTTCTTAGTTCATTCTGCAGAATGGAAACCCATGCC 

TATTAGAAATGACAGTACTTATTAATTGAGTCCCTAAGGAATATTCAGCCCACTACATA 

GATAGCTTTTTTTTTTTTTTTTTTTTTTTAATAAGGACACCTCTTTCCAAACAGGCCATCA 
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AATATGTTCTTATCTCAGACTTACGTTGTTTTAAAAGTTTGGAAAGATACACATCTTTTC 

ATACCCCCCCTTAGGAGGTTGGGCTTTCATATCACCTCAGCCAACTGTGGCTCTTAATT 

TATTGCATAATGATATCCACATCAGCCAACTGTGGCTCTTTAATTTATTGCATAATGAT 

ATTCACATCCCCTCAGTTGCAGTGAATTGTGAGCAAAAGATCTTGAAAGCAAAAAGCA 

CTAATTAGTTTAAAATGTCACTTTTTTGGTTTTTATTATACAAAAACCATGAAGTACTTT 

TTTTATTTGCTAAATCAGATTGTTCCTTTTTAGTGACTCATGTTTATGAAGAGAGTTGAG 

TTTAACAATCCTAGCTTTTAAAAGAAACTATTTAATGTAAAATATTCTACATGTCATTC 

AGATATTATGTATATCTTCTAGCCTTTATTCTGTACTTTTAATGTACATATTTCTGTCTTG 

CGTGATTTGTATATTTCACTGGTTTAAAAAACAAACATCGAAAGGCTTATTCCAAATGG 

AAG 



Figure 41 

MAGSAMSSETFLVALAIFFSFAQWIEANSWWSLGMNNPVQMSEVYIIGAQPLCSQLAGLS 
QGQKIOXHLYQDHMQYKjEGAECTGIOC 

AFTYAVSAAGVWAMSRACREGELSTCGCSRAARPKDLPRDWLWGGCGDN1DYGYRFA 

KEFVDAREI^RIHAKGSYESARILMNLHNKEAGRRTVYmADVACKCHGVSGSCSLKTC 

WLQLADFRKVGDALKEKYDSAAAMRLNSRGKLVQVNSRFNSPTTQDLVYIDPSPDYCVR 

NESTGSLGTQGPXCNKTSEGMDGCELMCCGRGYDQFKTVQTERCHCKFHWCCYVKCKK 

CTEIVDQFVCK 



Figure 42 

GGCACGAGCGCAGGAGACACAGGCGCTGGCTGCCCCGTCCGCTCTCCGCCTCCGCCGC 

GCCCTCCTCGCCCGGGATGGGCCCCCCCGCCGCCGCCGGATCCCTCGCCTCCCGGCCGC 

CGCCGTTGCGCTCGCCGCGCTCGCACTGAAGCCCGGGCCCTCGCGCGCCGCGGTTCGC 

CCCGCAGCCTCGCCCCCTGCCCACCCGGGCGGCCGTAGGGCGGTCACGATGCTGCCGC 

CCTTACCCTCCCGCCTCGGGCTGCTGCTGCTGCTGCTCCTGTGCCCGGCGCACGTCGGC 

GGACTGTGGTGGGCTGTGGGCAGCCCCTTGGTTATGGACCCTACCAGCATCTGCAGGA 

AGGCACGGCGGCTGGCCGGGCGGCAGGCCGAGTTGTGCCAGGCTGAGCCGGAAGTGG 

TGGCAGAGCTAGCTCGGGGCGCCCGGCTCGGGGTGCGAGAGTGCCAGTTCCAGTTCCG 

CTTCCGCCGCTGGAATTGCTCCAGCCACAGCAAGGCCTTTGGACGCATCCTGCAACAG 

GACATTCGGGAGACGGCCTTCGTGTTCGCCATCACTGCGGCCGGCGCCAGCCACGCCG 

TCACGCAGGCCTGTTCTATGGGCGAGCTGCTGCAGTGCGGCTGCCAGGCGCCCCGCGG 

GCGGGCCCCTCCCCGGCCCTCCGGCCTGCCCGGCACCCCCGGACCCCCTGGCCCCGCG 

GGCTCCCCGGAAGGCAGCGCCGCCTGGGAGTGGGGAGGCTGCGGCGACGACGTGGAC 

TTCGGGGACGAGAAGTCGAGGCTCTTTATGGACGCGCGGCACAAGCGGGGACGCGGA 

GACATCCGCGCGTTGGTGCAACTGCACAACAACGAGGCGGGCAGGCTGGCCGTGCGG 

AGCCACACGCGCACCGAGTGCAAATGCCACGGGCTGTCGGGATCATGCGCGCTGCGCA 

CCTGCTGGCAGAAGCTGCCTCCATTTCGCGAGGTGGGCGCGCGGCTGCTGGAGCGCTT 

CCACGGCGCCTCACGCGTCATGGGCACCAACGACGGCAAGGCCCTGCTGCCCGCCGTC 

CGCACGCTCAAGCCGCCGGGCCGAGCGGACCTCCTCTACGCCGCCGATTCGCCCGACT 

TTTGCGCCCCCAACCGACGGACCGGCTCCCCCGGCACGCGCGGTCGCGCCTGCAATAG 

CAGCGCCCCGGACCTCAGCGGCTGCGACCTGCTGTGCTGCGGCCGCGGGCACCGCCAG 
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GAGAGCGTGCAGCTCGAAGAGAACTGCCTGTGCCGCTTCCACTGGTGCTGCGTAGTAC 

AGTGCCACCGTTGCCGTGTGCGCAAGGAGCTCAGCCTCTGCCTGTGACCCGCCGCC 

CGGCCGCTAGACTGACTTCGCGCAGCGGTGGCTCGCACCTGTGGGACCTCAGGGCACC 

GGCACCGGGCGCCTCTCGCCGCTCGAGCCCAGCCTCTCCCTGCCAAAGCCCAACTCCC 

AGGGCTCTGGAAATGGTGAGGCGAGGGGCTTGAGAGGAACGCCCACCCACGAAGGCC 

CAGGGCGCCAGACGGCCCCGAAAAGGCGCTCGGGGAGCGTTTAAAGGACACTGTACA 

GGCCCTCCCTCCCCTTGGCCTCTAGGAGGAAACAGTTTTTTAGACTGGAAAAAAGCCA 

GTCTAAAGGCCTCTGGATACTGGGCTCCCCAGAACTGCTGGCCACAGGATGGTGGGTG 

AGGTTAGTATCAATAAAGATATTTAAACCAAAAAAAAAAAAAAAAAAAAA 

Figure 43 

MLPPLPSI^GLLLLLLLCPAHVGGLWIVAVGSPLVMDPTSICRKAERLAGRQAELCQAEPE 

WAELARGAPXGVP^CQFQFRFRRWNCSSHSKAPGRILQQDIRETAFVFA[TAAGASHAVT 

QACSMGELLQCGCQAPRGRAPPRPSGLPGTPGPPGPAGSPEGSAAWEWGGCGDDVDFGD 

EKSRLFMDAPJIKllGRGDIPvALVQLHNlSffiAGRXAVRSHTRTECKCHGLSGSCALRTCWQK 

LPPFRE VGARLLERFHGASRVMGTND GKALLP A VRTLKPPGRADLLY AAD SPDFC APNRR 

TGSPGTRGRACNSSAPDLSGCDLLCCGRGHRQESVQLEENCLCRFHWCCVVQCHRCRVRK 

ELSLCL 



Figure 44 

CACGCGTCCGGGCCAATCGGGACTATGAACCGGAAAGCGCTGCGCTGCCTGGGCCACC 

TCTTTCTCAGCCTGGGCATGGTCTGCCTCCGGATCGGTGGCTTCTCCTCAGTGGTAGCTC 

TGGGCGCAACGATCATCTGTAACAAGATCCCAGGCCTGGCTCCCAGACAGCGGGCGAT 

CTGCCAGAGCCGGCCCGACGCCATCATCGTCATAGGAGAAGGCTCACAAATGGGCCTG 

GACGAGTGTCAGTTTCAGTTCCGCAATGGCCGCTGGAACTGCTCTGCACTGGGAGAGC 

GCACCGTCTTCGGGAAGGAGCTCAAAGTGGGGAGCCGGGACGGTGCGTTCACCTACGC 

CATCATTGCCGCCGGCGTGGCCCACGCCATCACAGCTGCCTGTACCCATGGCAACCTG 

AGCGACTGTGGCTGCGACAAAGAGAAGCAAGGCCAGTACCACCGGGACGAGGGCTGG 

AAGTGGGGTGGCTGCTCTGCCGACATCCGCTACGGCATCGGCTTCGCCAAGGTCTTTGT 

GGATGCCCGGGAGATCAAGCAGAATGCCCGGACTCTCATGAACTTGCACAACAACGAG 

GCAGGCCGAAAGATCCTGGAGGAGAACATGAAGCTGGAATGTAAGTGCCACGGCGTG 

TCAGGCTCGTGCACCACCAAGACGTGCTGGACCACACTGCCACAGTTTCGGGAGCTGG 

GCTACGTGCTCAAGGACAAGTACAACGAGGCCGTTCACGTGGAGCCTGTGCGTGCCAG 

CCGCAACAAGCGGCCCACCTTCCTGAAGATCAAGAAGCCACTGTCGTACCGCAAGCCC 

ATGGACACGGACCTGGTGTACATCGAGAAGTCGCCCAACTACTGCGAGGAGGACCCGG 

TGACCGGCAGTGTGGGCACCCAGGGCCGCGCCTGCAACAAGACGGCTCCCCAGGCCAG 

CGGCTGTGACCTCATGTGCTGTGGGCGTGGCTACAACACCCACCAGTACGCCCGCGTG 

TGGCAGTGCAACTGTAAGTTCCACTGGTGCTGCTATGTCAAGTGCAACACGTGCAGCG 

AGCGCACGGAGATGTACACGTGCAAGTGAGCCCCGTGTGCACACCACCCTCCCGCTGC 

AAGTCAGATTGCTGGGAGGACTGGACCGTTTCCAAGCTGCGGGCTCCCTGGCAGGATG 

CTGAGCTTGTCTTTTCTGCTGAGGAAGGTACTTTTCCTGGGTTTCCTGCAGGCATCCGTG 

GGGGAAAAAAAATCTCTCAGAACCCTCAACTATTCTGTTCCACACCCAATGCTGCTCCA 

CCCTCCCCCAGACACAGCCCAAGTCCCTCCGCGGCTGGAGCGAAGCCTTCTGCAGCAG 

GAACTCTGGACCCCTGGGCCTCATCACAGCAATATTTAACAATTTATTCTGATAAAAAT 

AATATTAATTTATTTAATTAAAAAGAATTCTTCCACCTCAAAAAAAAAAAAAAAAAAA 

AAAAAAAAGGGGGG 



Figure 45 
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MNRKARRCLGHLFLSLGMVYLRIGGFSSVVA 
EGSQMGLDECQFQFRNTGRWNCSALGERTWGKELK^ 

TQGMvSDCGCDK^KQGQYHRDEGWKWGGCSADIRYGIGFAKWVDAREIKQNARTLMNL 
HNNEAGRKTLEEmiKLECKCHGVSGSCTTKTCWTTLPQFRELGYVEKDKYNBAVHVEPV 
RASPJSlKRPTFLKIEXPLSYRKTlVODTDLVYIEKSPlSfYCE 
GCDLMCCGRGYNTHQYARVWQCNCKFHWCCYVKCNTCSERTEMYTCK 



Figure 46 



MHRNFRECWEFYVFLCFGVLYVKLGALS S WALGANIICNKIP GLAPRQRAICQSRPD AIIVIG 

EGAQMGll^CQYQFRFGRWNCSALGEKTVFGQELRVGSREAAFTYAITAAGVAHAVTAA 

CSQGNLSNCGCDREKQGYYNQAEGWKWGGCSADVRYGIDFSRRFVDAREIECKNARR^ 

NLHNNEAGRKVLEDRMQLECKCHGVSGSCTTKTCWTTLPKFREVGHLLKEKYNAAVQVE 

VYRASRLRQPTFLRIKQLRSYQKPMETDLVYIEKSPNYCEEDAATGSVGTQGRLCNRTSPG 

ADGCDTMCCGRGYNTHQYTKVWQCNCKFHWCCFVKCNTCSERTEVFTCK 



Figure 47 

TCCGCTTACACACCAAGGAAAGTTGGGCTTTGAAGAATTCCATCCCCATGGCCACTGG 

AGGAAGAATATTTCNCCCGTCTTGCTTACCCATCTCCCCAGTTTTTTGGAATTTTCTCTA 

GCTGTTACTCCAGAGGATTATGTTTCTTTCAAAGCCTTCTGTGTACATCTGTCTTTTCAC 

CTGTGTCCTCCAACTCAGCCACAGCTGGTCGGTGAACAATTTCCTGATGACTGGTCCAA 

AGGCTTACCTGATTTACTCCAGCAGTGTGGCAGCTGGTGCCCAGAGTGGTATTGAAGA 

ATGCAAGTATCAGTTTGCCTGGGACCGCTGGAACTGCCCTGAGAGAGCCCTGCAGCTG 

TCCAGCCATGGTGGGCTTCGCAGTGCCAATCGGGAGACAGCATTTGTGCATGCCATCA 

GTTCTGCTGGAGTCATGTACACCCTGACTAGAAACTGCAGCCTTGGAGATTTTGATAAC 

TGTGGCTGTGATGACTCCCGCAACGGGCAACTGGGGGGACAAGGCTGGCTGTGGGGAG 

GCTGCAGTGACAATGTGGGCTTCGGAGAGGCGATTTCCAAGCAGTTTGTCGATGCCCT 

GGAAACAGGACAGGATGCACGGGCAGCCATGAACCTGCACAACAACGAGGCTGGCCG 

CAAGGCGGTGAAGGGCACCATGAAACGCACGTGTAAGTGCCATGGCGTGTCTGGCAGC 

TGCACCACGCAGACCTGTTGGCTGCAGCTGCCCGAGTTCCGCGAGGTGGGCGCGCACC 

TGAAGGAGAAGTACCACGCAGCACTCAAGGTGGACCTGCTGCAGGGTGCTGGCAACA 

GCGCGGCCGCCCGCGGCGCCATCGCCGACACCTTTCGCTCCATCTCTACCCGGGAGCTG 

GTGCACCTGGAGGAGTCCCCGGACTACTGCCTGGAGAACAAAACGCTAGGGCTGCTGG 

GCACCGAAGGCCGAGAGTGCCTAAGGCGCGGGCGGGCCCTGGGTCGCTGGGAACTCC 

GCAGCTGCCGCCGGCTCTGCGGGGACTGCGGGCTGGCGGTGGAGGAGCGCCGGGCCG 

AGACCGTGTCCAGCTGCAACTGCAAGTTCCACTGGTGCTGTGCAGTCCGCTGCGAGCA 

GTGCCGCCGGAGGGTCACCAAGTACTTCTGTAGCCGCGCAGAGCGGCCGCGGGGGGGC 

GCTGCGCACAAACCCGGGAGAAAACCCTAAGGGTTTCCTCTGCCCCCTCCTTTTCCCAC 

TGGTTCTTGGCTTCCTTTAGAGACCCCGGTAATTGTGGAACCTAGGGAATGGGGAACCC 

GCTCTCCCAGACCTAGGGATCCTGAAAGGGAAAAACTGCAATTTCTCCAAAGCTTGCC 

ACTTTCCAGCCTGTTTCCCCAATTCCTCTGTGCTCTCCTAAAGCTCTGTCTGAATCCTCG 

CAGCCACACCTAGGTCTGAAAACTCAGGCTTTGAGTTACTGATCTTCCTTGGATTAGGA 

AAACAGGTGTTCCTCCTCCCCTCTCCTATCAGCCCTAATCTCTGACCTAGCCTATCAAC 

CCTTAGGCGCTGGAAAAACCTTCTCATACACGCAGGACCCAGGTTAACTCAAAGCTTT 
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GCCCTTTTGCCCACTGTCTGCTACCAGGGGCTCACCCTCTGCTGCACCTCTCTTCTGCAC 

AGCTCCTCCCCTGCTACTGCTGACCAAATTCCCAGGAATCTTGAATGCTTTCTCTCCTCT 

TCTCCCTTTCCTTTCCCAAAAAAAACTGAGGAAACTGGCCCCGGAAAAGCATGTCTTTG 

GGGTTGGTTCCTAGAGGCAGAGGTTGAAGATGGAAGAGGGAGCTCTGGAGTGCTAACT 

TGAACACCAAGGGTGCTACTCATCCCTATGGTATCATATCATGAATGGACTTTACTAGT 

GGGGC AATGACTTTCCTAGACAATAACCCGAGGGACTCCAGATACATACCCCGAAGGT 

CTAGGAAATACGTTAAGGGCAGATTACAGTCATTTCCTACCCTTTAAAGGTAACTTCTC 

CCTTCTCCTGACCTACTTCCTCCTAGCAACCAACTTTACCTCTTCTTCTCCAAAGGATCT 

TTGTTCCTCTGAGCCAAGACTGAGGTAAATAAAGCCACTTTCCTCTTCAGATCCTGGTC 

TGCACCTCTAGA 



Figure 48 

MFLSKPSVYICLFTCVLQLSHSWSVl>nS!FLMTGPKAYL]YSSSVAAGAQSGIEECKYQFAWD 

RWNCPERALQLSSHGGLRSANRETAFVHAISSAGVMYTLTRNCSLGDFDNCGCDDSRNGQ 

LGGQGWLWGGCSDNVGFGEAISKQFVDALETGQDARAAMNLHNT>ffiAGRKAVKGTMKR 

TCKCHGVSGSCTTQTCWXQLPEFREVGAHLKEKYHAALKVDLLQGAGNSAAARGAIADT 

FRSISTRELVHLEDSPDYCLENKTLGLLGTEGRECLRRGRALGRWELRSCRRLCGDCGLAV 

EERRAEWSSCNCKFHWCCAVRCEQCRRRVTKYFCSRAERPRGGAAHKPGRKP 



Figure 49 

GCGGCCGCGTCGACGGAGGGGCTGCAGCTCCGTCAGCCCGGCAGAGCCACCCTGAGCT 

CGGTGAGAGCAAAGCCAGAGCCCCCAGTCCTTTGCTCGCCGGCTTGCTATCTCTCTCGA 

TCACTCCCTCCCTTCCTCCCTCCCTTCCTCCCGGCGGCCGCGGCGGCGCTGGGGAAGCG 

GTGAAGAGGAGTGGCCCGGCCCTGGAAGAATGCGGCTCTGACAAGGGGACAGAACCC 

AGCGCAGTCTCCCCACGGTTTAAGCAGCACTAGTGAAGCCCAGGCAACCCAACCGTGC 

CTGTCTCGGACCCCGCACCCAAACCACTGGAGGTCCTGATCGATCTGCCCACCGGAGC 

CTCCGGGCTTCGACATGCTGGAGGAGCCCCGGCCGCGGCCTCCGCCCTCGGGCCTCGC 

GGGTCTCCTGTTCCTGGCGTTGTGCAGTCGGGCTCTAAGCAATGAGATTCTGGGCCTGA 

AGTTGCCTGGCGAGCCGCCGCTGACGGCCAACACCGTGTGCTTGACGCTGTCCGGCCT 

GAGCAAGCGGCAGCTAGACCTGTGCCTGCGCAACCCCGACGTGACGGCGTCCGCGCTT 

CAGGGTCTGCACATCGCGGTCCACGAGTGTCAGCACCAGCTGCGCGACCAGCGCTGGA 

ACTGCTCCGCGCTTGAGGGCGGCGGCCGCCTGCCGCACCACAGCGCCATCCTCAAGCG 

CGGTTTCCGAGAAAGTGCTTTTTCCTTCTCCATGCTGGCTGCTGGGGTCATGCACGCAG 

TAGCCACGGCCTGCAGCCTGGGCAAGCTGGTGAGCTGTGGCTGTGGCTGGAAGGGCAG 

TGGTGAGCAGGATCGGCTGAGGGCCAAACTGCTGCAGCTGCAGGCACTGTCCCGAGGC 

AAGAGTTTCCCCCACTCTCTGCCCAGCCCTGGCCCTGGCTCAAGCCCCAGCCCTGGCCC 

CCAGGACACATGGGAATGGGGTGGCTGTAACCATGACATGGACTTTGGAGAGAAGTTC 

TCTCGGGATTTCTTGGATTCCAGGGAAGCTCCCCGGGACATCCAGGCACGAATGCGAA 

TCCACAACAACAGGGTGGGGCGCCAGGTGGTAACTGAAAACCTGAAGCGGAAATGCA 

AGTGTCATGGCACATCAGGCAGCTGCCAGTTCAAGACATGCTGGAGGGCGGCCCCAGA 

GTTCCGGGCAGTGGGGGCGGCGTTGAGGGAGCGGCTGGGCCGGGCCATCTTCATTGAT 

ACCCACAACCGCAATTCTGGAGCCTTCCAGCCCCGTCTGCGTCCCCGTCGCCTCTCAGG 

AGAGCTGGTCTACTTTGAGAAGTCTCCTGACTTCTGTGAGCGAGACCCCACTATGGGCT 

CCCCAGGGACAAGGGGCCGGGCCTGCAACAAGACCAGCCGCCTGTTGGATGGCTGTGG 

CAGCCTGTGCTGTGGCCGTGGGCACAACGTGCTCCGGCAGACACGAGTTGAGCGCTGC 

CATTGCCGCTTCCACTGGTGCTGCTATGTGCTGTGTGATGAGTGCAAGGTTACAGAGTG 

GGTGAATGTGTGTAAGTGAGGGTCAGCCTTACCTTGGGGCTGGGGAAGAGGACTGTGT 

GAGAGGGGCGCCTTTTCAGCCCTTTGCTCTGATTTCCTTCCAAGGTCACTCTTGGTCCCT 
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GGAAGCTTAAAGTATCTACCTGGAAACAGCTTTAGGGGTGGTGGGGGTCAGGTGGACT 

CTGGGATGTGTAGCCTTCTCCCCAACAATTGGAGGGTCTTGAGGGGAAGCTGCCACCC 

CTCTTCTGCTCCTTAGACACCTGAATGGACTAAGATGAAATGCACTGTATTGCTCCTCC 

CACTTCTCAACTCCAGAGCCCCTTTAACCCTGATTCATACTCCTTTTGGCTGGGGAGTC 

CCTATAGTTTCACCACTCCTCTCCCTTGAGGGATAACCCCAGGCACTGTTTGGAGCCAT 

AAGATCTGTATCTAGAAAGAGATCACCCACTCCTATGTACTATCCCCAAACTCCTTTAC 

TGCAGCCTGGGCTCCCTCTTGTGGGATAATGGGAGACAGTGGTAGAGAGGTTTTTCTTG 

GGAAAGAGACAGAGTGCTGAGGGGCACTCTCCCCTGAATCCTCAGAGAGTTGTCTGTC 

CAGGCCCTTAGGGAAGTTGTCTCCTTCCATTCAGATGTTAATGGGGACCCTCCAAAGGA 

AGGGGTTTTCCCATGACTCTTGGAGCCTCTTTTTCCTTCTTCAGCAGGAAGGGTGGGAA 

GGGATAATTTATCATACTGAGACTTGTTCTTGGTTCCTGTTTGAAACTAAAATAAATTA 

AGTTACTGGAAAAAAAAAAAAAAAAAA 

Figure 50 



MLEEPRPRPPPSGLAGLLFLALCSRALSNEILGLKLPGEPPLTANTVCLTLSGLSKRQLDLCL 
RNPDVTASALQGLHIAVHECQHQLPJDQRWNCSALEGGGRLPHHSAILKIRGFRESAFSFSM 
LAAGVMHAVATACSLGKLVSCGCGWKGSGEQDRLRAKLLQLQALSRGKSFPHSLPSPGP 
GSSPSPGPQDTWEWGGCNHD]VnDFGEKFSRDFLDSREA^ 

LKRKCKCHGTSGSCQFKTCWI^APEFRAVGAALRERLGRAIFIDTH>^SGAFQPRLRPRR 
LSGELVYFEKSPDFCERDPTMGSPGTRGRACNKTSRLLDGCGSLCCGRGHNVLRQTRVER 

CHCRFHWCCYVLCDECKVTEWVNVCK 



Figure 51 

TAACCCGCCGCCTCCGCTCTCCCCGGCTGCAGGCGGCGTGCAGGACCAGCGGCGGCCG 

TGCAGGCGGAGGACTTCGGCGCGGCTCCTCCTGGGTGTGACCCCGGGCGCGCCCGCCG 

CGCGACGATGAGGGCGCGGCCGCAGGTCTGCGAGGCGCTGCTCTTCGCCCTGGCGCTC 

CAGACCGGCGTGTGCTATGGCATCAAGTGGCTGGCGCTGTCCAAGACACCATCGGCCC 

TGGCACTGAACCAGACGCAACACTGCAAGCAGCTGGAGGGTCTGGTGTCTGCACAGGT 

GCAGCTGTGCCGCAGCAACCTGGAGCTCATGCACACGGTGGTGCACGCCGCCCGCGAG 

GTCATGAAGGCCTGTCGCCGGGCCTTTGCCGACATGCGCTGGAACTGCTCCTCCATTGA 

GCTCGCCCCCAACTATTTGCTTGACCTGGAGAGAGGGACCCGGGAGTCGGCCTTCGTG 

TATGCGCTGTCGGCCGCCACCATCAGCCACGCCATCGCCCGGGCCTGCACCTCCGGCG 

ACCTGCCCGGCTGCTCCTGCGGCCCCGTCCCAGGTGAGCCACCCGGGCCCGGGAACCG 

CTGGGGAAGATGTGCGGACAACCTCAGCTACGGGCTCCTCATGGGGGCCAAGTTTTCC 

GATGCTCCTATGAAGGTGAAAAAAACAGGATCCCAAGCCAATAAACTGATGCGTCTAC 

ACAACAGTGAAGTGGGGAGACAGGCTCTGCGCGCCTCTCTGGAAATGAAGTGTAAGTG 

CCATGGGGTGTCTGGCTCCTGCTCCATCCGCACCTGCTGGAAGGGGCTGCAGGAGCTG 

CAGGATGTGGCTGCTGACCTCAAGACCCGATACCTGTCGGCCACCAAGGTAGTGCACC 

GACCCATGGGCACCCGCAAGCACCTGGTGCCCAAGGACCTGGATATCCGGCCTGTGAA 

GGACTGGGAACTTGTTTATTTGCAGAGCTCACCTGACTTTTGCATGAAGAATGAGAAG 

GTGGGCTCCCACGGGACACAAGACAGGCAGTGCAACAAGACTTCCAACGGAAGCGAC- 

AGCTGCGACCTTATGTGCTGCGGGCGTGGCTACAACCCCTACACAGACCGCGTGGTCG 

AGCGGTGCCACTGTAAGTACCACTGGTGTTGCTACGTCACCTGCCGCAGGTGTGAGCGT 

ACCGTGGAGCGCTATGTCTGCAAGTGAGGCCCTGCCCTCCGCCCCACGCAGGAGCGAG 

GACTTTGCTCAAGGACCCTCAGCAACTGGGGCCGGGGGCCTGGAGACACTCCATGGAG 

CTCTGCTTGTGAATTCCAGATGCCAGGCATGGGAGGCGGCTTGTGCTTTGCCTTCACTT 

GGAAGCCACCAGGAACAGAAGGTCTGGCCACCCTGGAAGGAGNGCAGGACATCAAAG 

GAAACCGACAAGATTAAAAATAACTTGGCAGCCTGAGNTCTGGAGTGCCCACAGNNTG 
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GTGTAAGGAGCGGGGCTTGGGATCGGTGAGACTGATACAGACTTGACCTTTCAGGGCC 

ACAGAGACCAGCCTCCGGGAAGGGGTCTGCCCGCCTTCTTCAGAATGTTCTGCGGGAC 

CCCCTGGCCCACCCTGGGGTCTGAGCCTGCTGGGCCACCACATGGAATCACTAGCTTCG 

GGTTGTAAATGTTTTCTTTTGTTTNTTGCTTTTTCTTCCTTTGGGATGTTGGAAGCTACA 

GAAATATTTATAAAACATAGCTTTTTCTTTGGGGTGGCACTTCTCAATTCCTCTTTATAT 

ATTTTANATATATAAATATATATGTATATATATAATGATCTCTAATNTAAAACTAGCTT 

TTTAAGCAGCTGTATGAAATAAATGCTGAGTGAGCCCCAGCCCGCCCCTGCAGTTCCC 

GGCCTCGTCAAGTGAACTCGGCAGACCCTGGGGCTGGCAGAGGGAGCTCTCCAGTTTC 

CGGGCA 



Figure 52 

MRARPQVCEALLFALALQTGVCYGIKWLALSKTPSALALNQTQHCKQLEGLVSAQVQLCR 

SNLELMHT V VHAAIUi\TVlKACRRAF ADMR WNC S S IEL APNYLLD LERGTRE S AF V Y ALS A 

ATISHAIARACTSGDLPGCSCGPWGEPPGPGNRWGRCADNLSYGLLMGAKFSDAPMKVK 

KTGSQANKLMRLHKSEVGRQALP^SLEMKCKCHGVSGSCSIRTCWKGLQELQDVAADLK 

TRYLSATKVVHRPMGTRKHLWKDLDIRPVKDWELVYLQSSPDFCMKNEKVGSHGTQDR 

QCNKTSNGSDSCDLMCCGRGYNPYTDRWERCHCKYHWCCYVTCRRCERTVERYVCK 

Figure 53 

GGCGCGGCAAGATGCTGGATGGGTCCCCGCTGGCGCGCTGGCTGGCCGCGGCCTTCGG 

GCTGACGCTGCTGCTCGCCGCGCTGCGCCCTTCGGCCGCCTACTTCGGGCTGACGGGCA 

GCGAGCCCCTGACCATCCTCCCGCTGACCCTGGAGCCAGAGGCGGCCGCCCAGGCGCA 

CTACAAGGCCTGCGACCGGCTGAAGCTGGAGCGGAAGCAGCGGCGCATGTGCCGCCG 

GGACCCGGGCGTGGCAGAGACGCTGGTGGAGGCCGTGAGCATGAGTGCGCTCGAGTG 

CCAGTTCCAGTTCCGCTTTGAGCGCTGGAACTGCACGCTGGAGGGCCGCTACCGGGCC 

AGCCTGCTCAAGCGAGGCTTCAAGGAGACTGCCTTCCTCTATGCCATCTCCTCGGCTGG 

CCTGACGCACGCACTGGCCAAGGCGTGCAGCGCGGGCCGCATGGAGCGCTGTACCTGC 

GATGAGGCACCCGACCTGGAGAACCGTGAGGCCTGGCAGTGGGGGGGCTGCGGAGAC 

AACCTTAAGTACAGCAGCAAGTTCGTCAAGGAATTCCTGGGCAGACGGTCAAGCAAGG 

ATCTGCGAGCCCGTGTGGACTTCCACAACAACCTCGTGGGTGTGAAGGTGATCAAGGC 

TGGGGTGGAGACCACCTGCAAGTGCCACGGCGTGTCAGGCTCATGCACGGTGCGGACC 

TGCTGGCGGCAGTTGGCGCeTTTCCATGAGGTGGGCAAGCATCTGAAGCACAAGTATG 

AGACGGCACTCAAGGTGGGCAGCACCACCAATGAAGCTGCCGGCGAGGCAGGTGCCA 

TCTCCCCACCACGGGGCCGTGCCTCGGGGGCAGGTGGCAGCGACCCGCTGCCCCGCAC 

TCCAGAGCTGGTGCACCTGGATGACTCGCCTAGCTTCTGCCTGGCTGGCCGCTTCTCCC 

CGGGCACCGCTGGCCGTAGGTGCCACCGTGAGAAGAACTGCGAGAGCATCTGCTGTGG 

CCGCGGCCATAACACACAGAGCCGGGTGGTGACAAGGCCCTGCCAGTGCCAGGTGCGT 

TGGTGCTGCTATGTGGAGTGCAGGCAGTGCACGCAGCGTGAGGAGGTCTACACCTGCA 

AGGGCTGAGTTCCCAGGCCCTGCCAGCCCTGCTGCACAGGGTGCAGGCATTGCACACG 

GTGTGAAGGGTCTACACCTGCACAGGCTGAGTTCCTGGGCTCGACCAGCCCAGCTGCG 

TGGGGTACAGGCATTGCACACAGTGTGAATGGGTCTACACCTGCATGGGCTGAGTCCC 

TGGGCTCAGACCTAGCAGCGTGGGGTAGTCCCTGGGCTCAGTCCTAGCTGCATGGGGT 

GCAGGCATTGCACAGAGCATGAATGGGCCTACACCTGCCAAGGCTGAATCCCTGGGCC 

CAGCCAGCCCTGCTGCACATGGCACAGGCATTGCACACGGTGTGAGGAGTGTACACCT 

GCAAGGGCTGAGGCCCTGGGCCCAGTCAGCCCTGCTGCTCAGAGTGCAGGCATTGCAC 

ATGGTGTGAGAAGGTCTACACCTGCAAGGGACGAGTCCCCGGGCCTGGCCAACCCTGC 

TGTGCAGGGTGAGGGCCATGCATGCTAGTATGAGGGGTCTACACCTGCAAGGACTGAG 

AGGCTTTT 
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Figure 54 



MLDGSPLARWLAAAFGLTLLLAALRPSAAYFGLTGSEPLTILPLTLEPEAAAQAHYKACDR 
LliOJiiRKQRRMCRRDPGVAETLVEAVSM 

TAFLYAISSAGLTHALAKA.CSAGRMERCTCDEAPDLENREAWQWGGCGDNLKYSSKFVK 
EFLGRRSSKDLRARVDFHNNLVGVKVIKAGVETTCKCHGVS GS CT VRTC WRQLAPFHEVG 
KHLKHKYETALKVGSTTNEAAGEAGAISPPRGRASGAGGSDPLPRTPELVHLDDSPSFCLA 
GRFSPGTAGRRCHREKNCESICCGRGHNTQSRWTRPCQCQVRWCCYVECRQCTQREEVY 
TCKG 

Figure 55 

AGCCTGC AAAAACCACAGAGGGCAAAGCCAGAAAGATGGAAAGGCACCCACCCATGC 

AGCTCACCACTTGCCTCAGGGAGACCCTCTTCACAGGGGCTTCTCAAAAGACCTCCCTA 

TGGTGGTTGGGCATTGCCTCCTTCGGGGTTCCAGAGAAGCTGGGCTGCGCCAATTTGCC 

GCTGAACAGCCGCCAGAAGGAGCTGTGCAAGAGGAAACCGTACCTGCTGCCGAGCAT 

CCGAGAGGGCGCCCGGCTGGGCATTCAGGAGTGCAGGAGCCAGTTCAGACACGAGAG 

ATGGAACTGCATGATCACCGCCGCCGCCACTACCGCCCCGATGGGCGCCAGCCCCCTC 

TTTGGCTACGAGCTGAGCAGCGGCACCAAAGAGACAGCATTTATTTATGCTGTGATGG 

CTGCAGGCCTGGTGCATTCTGTGACCAGGTCATGCAGTGCAGGCAACATGACAGAGTG 

TTCCTGTGACACCACCTTGCAGAACGGCGGCTCAGCAAGTGAAGGCTGGCACTGGGGG 

GGCTGCTCCGATGATGTCCAGTATGGCATGTGGTTCAGCAGAAAGTTCCTAGATTTCCC 

CATCGGAAACACCACGGGCAAAGAAAACAAAGTACTATTAGCAATGAACCTACATAA 

CAATGAAGCTGGAAGGCAGGCTGTCGCCAAGTTGATGTCAGTAGACTGCCGCTGCCAC 

GGAGTTTCCGGCTCCTGTGCTGTGAAAACATGGTGGAAAACCATGTCTTCTTTTGAAAA 

GATTGGCCATTTGTTGAAGGATAAATATGAAAACAGTATCCAGATATCAGACAAAATA 

AAGAGGAAAATGCGCAGGAGAGAAAAAGATCAGAGGAAAATACCAATCCATAAGGAT 

GATCTGCTCTATGTTAATAAGTCTCCCAACTACTGTGTAGAAGATAAGAAACTGGGAAT 

CCCAGGGACACAAGGCAGAGAATGCAACCGTACATCAGAGGGTGCAGATGGCTGCAA 

CCTCCTCTGCTGTGGCCGAGGTTACAACACCCATGTGGTCAGGCACGTGGAGAGGTGT 

GAGTGTAAGTTCATCTGGTGCTGCTATGTCCGTTGCAGGAGGTGTGAAAGCATGACTG 

ATGTCCACACTTGCAAGTAACCACTCCATCCAGCCTTGGGCAAGATGCCTCAGCAATAT 

ACAATGGCATTGCAACCAGAGAGGTGCCCATCCCTGTGCAGCGCTAGTAAAGTTGACT 

CTTGCAGTGGAATCCC 

Figure 56 

MDRAALLGLARECALWAALLVLFPYGAQGNWMWLGIASFGVPEKLGCANLPLNSRQKEL 

CKRETYLLPSIP^GARLGIQECGSQFRHERWNCMITAAATTAPMGASPLFGYELSSGTKET 

AFIYAVMAAGLVHSVTRSCSAGNMTECSCDTTLQNGGSASEGWHWGGCSDDVQYGMWF 

SRKELDFPIGNTTGKEr^VLLAMNDHNNEAGRQAVAKXMSVDCRCHGVSGSCAVKTCWK 

TMSSFEIHGHLLIQDKYENSR^IS 

KXGIPGTQGRECNRTSEGADGCr^LCCGRGYNTHVVRHVERCECKFIWCCYV 
RCRRCESMTDVHTCK 



Figure 57 

AGTTGAGGGATTGACACAAATGGTCAGGCGGCGGCGGCGGAGAAGGAGGCGGAGGCG 
CAGGGGGGAGCCGAGCCCGCTGGGCTGCGGAGAGTTGCGCTCTCTACGGGGCCGCGGC 
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CACTAGCGCGGCGCCGCCAGCCGGGAGCCAGCGAGCCGAGGGCCAGGAAGGCGGGAC 

ACGACCCCGGCGCGCCCTAGCCACCCGGGTTCTCCCCGCCGCCCGCGCTTCATGAATCG 

CAAGTTTCCGCGGCGGCGGCGGCTGCGGTACGCAGAACAGGAGCCGGGGGAGCGGGC 

CGAAAGCGGCTTGGGCTCGACGGAGGGCACCCGCGCAGAGGTCTCCCTGGCCGCAGG 

GGGAGCCGCCGCCGGCCGTGCCCCTGGCAGCCCCAGCGGAGCGGCGCCAAGAGAGGA 

GCCGAGAAAGTATGGCTGAGGAGGAGGCGCCTAAGAAGTCCCGGGCCGCCGGCGGTG 

GCGCGAGCTGGGAACTTTGTGCCGGGGCGCTCTCGGCCCGGCTGGCGGAGGAGGGCAG 

CGGGGACGCCGGTGGCCGCCGCCGCCCGCCAGTTGACCCCCGGCGATTGGCGCGCCAG 

CTGCTGCTGCTGCTTTGGCTGCTGGAGGCTCCGCTGCTGCTGGGGGTCCGGGCCCAGGC 

GGCGGGCCAGGGGCCAGGCCAGGGGCCCGGGCCGGGGCAGCAACCGCCGCCGCCGCC 

TCAGCAGCAACAGAGCGGGCAGCAGTACAACGGCGAGCGGGGCATCTCCGTCCCGGA 

CCACGGCTATTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCGTACAACCAG 

ACCATCATGCCCAACCTGCTGGGCCACACGAACCAGGAGGACGCGGGCCTGGAGGTGC 

ACCAGTTCTACCCTCTAGTGAAAGTGCAGTGTTCCGCTGAGCTCAAGTTCTTCCTGTGC 

TCCATGTACGCGCCCGTGTGCACCGTGCTAGAGCAGGCGCTGCCGCCCTGCCGCTCCCT 

GTGCGAGCGCGCGCGCCAGGGCTGCGAGGCGCTCATGAACAAGTTCGGCTTCCAGTGG 

CCAGACACGCTCAAGTGTGAGAAGTTCCCGGTGCACGGCGCCGGCGAGCTGTGCGTGG 

GCCAGAACACGTCCGACAAGGGCACCCCGACGCCCTCGCTGCTTCCAGAGTTCTGGAC 

CAGCAACCCTCAGCACGGCGGCGGAGGGCACCGTGGCGGCTTCCCGGGGGGCGCCGG 

CGCGTCGGAGCGAGGCAAGTTCTCCTGCCCGCGCGCCCTCAAGGTGCCCTCCTACCTCA 

ACTACCACTTCCTGGGGGAGAAGGACTGCGGCGCACCTTGTGAGCCGACCAAGGTGTA 

TGGGCTCATGTAGTTCGGGCCCGAGGAGCTGCGCTTCTCGCGCACCTGGATTGGCATTT 

GGTCAGTGCTGTGCTGCGCCTCCACGCTCTTCACGGTGCTTACGTACCTGGTGGACATG 

CGGCGCTTCAGCTACCCGGAGCGGCCCATCATCTTCTTGTCCGGCTGTTACACGGCCGT 

GGCCGTGGCCTACATCGCCGGCTTCCTCCTGGAAGACCGAGTGGTGTGTAATGACAAG 

TTCGCCGAGGACGGGGCACGCACTGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACC 

ATCCTCTTCATGATGCTCTACTTCTTCAGCATGGCCAGCTCCATCTGGTGGGTGATCCTG 

TCGCTCACCTGGTTCCTGGCGGCTGGCATGAAGTGGGGCCACGAGGCCATCGAAGCCA 

ACTCACAGTATTTTCACCTGGCCGCCTGGGCTGTGCCGGCCATCAAGACCATCACCATC 

CTGGCGCTGGGCCAGGTGGACGGCGATGTGCTGAGCGGAGTGTGCTTCGTGGGGCTTA 

ACAACGTGGACGCGCTGCGTGGCTTCGTGCTGGCGCCCCTCTTCGTGTACCTGTTTATC 

GGCACGTCCTTTCTGCTGGCCGGCTTTGTGTCGCTCTTCCGCATCCGCACCATCATGAA 

GCACGATGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTGCGCATTGGCGTCTTC 

AGCGTGCTGTACACTGTGCCAGCCACCATCGTCATCGCCTGCTACTTCTACGAGCAGGC 

CTTCCGGGACCAGTGGGAACGCAGCTGGGTGGCCCAGAGCTGCAAGAGCTACGCTATC 

CCCTGCCCTCACCTCCAGGCGGGCGGAGGCGCCCCGCCGCACCCGCCCATGAGCCCGG 

ACTTCACGGTCTTCATGATTAAGTACCTTATGACGCTGATCGTGGGCATCACGTCGGGC 

TTCTGGATCTGGTCCGGCAAGACCCTCAACTCCTGGAGGAAGTTCTACACGAGGCTCA 

CCAACAGCAAACAAGGGGAGACTACAGTCTGAGACCCGGGGCTCAGCCCATGCCCAG 

GCCTCGGCCGGGGCGCAGCGATCCCCCAAAGCCAGCGCCGTGGAGTTCGTGCCAATCC 

TGACATCTCGAGGTTTCCTCACTAGACAACTCTCTTTCGCAGGCTCCTTTGAACAACTC 

AGCTCCTGCAAAAGCTTCCGTCCCTGAGGCAAAAGGACACGAGGGCCCGACTGCCAGA 

GGGAGGATGGACAGACCTCTTGCCCTCACACTCTGGTACCAGGACTGTTCGCTTTTATG 

ATTGTAAATAGCCTGTGTAAGATTTTTGTAAGTATATTTGTATTTAAATGACGACCGAT 

CACGCGTTTTTCTTTTTCAAAAGTTTTTAATTATTTAGGGCGGTTTAACCATTTGAGGCT 

TTTCCTTCTTGCCCTTTTCGGAGTATTGCAAAGGAGCTAAAACTGGTGTGCAACCGCAC 

AGCGCTCCTGGTCGTCCTCGCGCGCCTCTCCCTACCACGGGTGCTCGGGACGGCTGGGC 

GCCAGCTCCGGGGCGAGTTCAGCACTGCGGGGTGCGACTAGGGCTGCGCTGCCAGGGT 

CACTTCCCGCCTCCTCCTTTTGCCCCCTCCCCCTCCTTCTGTCCCCTCCCTTTCTTTCCTG 

GCTTGAGGTAGGGGCTCTTAAGGTACAGAACTCCACAAACCTTCCAAATCTGGAGGAG 

GGCCCCCATACATTACAATTCCTCCCTTGCTCGGCGGTGGATTGCGAAGGCCCGTCCCT 

TCGACTTCCTGAAGCTGGATTTTTAACTGTCCAGAACTTTCCTCCAACTTCATGGGGGC 
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CCACGGGTGTGGGCGCTGGCAGTCTCAGCCTCCCTCCACGGTCACCTTCAACGCCCAG 

ACACTCCCTTCTCCCACCTTAGTTGGTTACAGGGTGAGTGAGATAACCAATGCCAAACT 

TTTTGAAGTCTAATTTTTGAGGGGTGAGCTCATTTCATTCTCTAGTGTCTAAAACCTGGT 

ATGGGTTTGGCCAGCGTCATGGAAAGATGTGGTTACTGAGATTTGGGAAGAAGCATGA 

AGCTTTGTGTGGGTTGGAAGAGACTGAAGATATGGGTTATAAAATGTTAATTCTAATTG 

CATACGGATGCCTGGCAACCTTGCCTTTGAGAATGAGACAGCCTGCGCTTAGATTTTAC 

CGGTCTGTAAAATGGAAATGTTGAGGTCACCTGGAAAGCTTTGTTAAGGAGTTGATGTT 

TGCTTTCCTTAACAAGACAGCAAAACGTAAACAGAAATTGAAAACTTGAAGGATATTT 

CAGTGTCATGGACTTCCTCAAAATGAAGTGCTATTTTCTTATTTTTAATCAAATAACTA 

GACATATATCAGAAACTTTAAAATGTAAAAGTTGTACACTTTCAACATTTTATTACGAT 

TATTATTCAGCAGCACATTCTGAGGGGGGAACAATTCACACCACCAATAATAACCTGG 

TAAGATTTCAGGAGGTAAAGAAGGTGGAATAATTGACGGGGAGATAGCGCCTGAAAT 

AAACAAAATATGGGCATGCATGCTAAAGGGAAAATGTGTGCAGGTCTACTGCATTAAA 

TCCTGTGTGCTCCTCTTTTGGATTTACAGAAATGTGTCAAATGTAAATCTTTCAAAGCC 

ATTTAAAAATATTCACTTTAGTTCTCTGTGAAGAAGAGGAGAAAAGCAATCCTCCTGAT 

TGTATTGTTTTAAACTTTAAGAATTTATCAAAATGCCGGTACTTAGGACCTAAATTTAT 

CTATGTCTGTCATACGCTAAAATGATATTGGTCTTTGAATTTGGTATACATTTATTCTGT 

TCACTATCACAAAATCATCTATATTTATAGAGGAATAGAAGTTTATATATATATAATAC 

CATATTTTTAATTTCACAAATAAAAAATTCAAAGTTTTGTACAAAATTATATGGATTTT 

GTGCCTGAAAATAATAGAGCTTGAGCTGTCTGAACTATTTTACATTTTATGGTGTCTCA 

TAGCCAATCCCACAGTGTAAAAATTCA 
Figure 58 



MAEEEAPKKSRAAGGGASWELCAGALSARLAEEGSGDAGGRRRPPVDPBIRLARQLLLLL 

WLLEAPLLLGVRAQAAGQGPGQGPGPGQQPPPPPQQQQSGQQYNGERGISVPDHGYCQPI 

SIPLCTDIAYNQTIMPr^LGHTNQEDAGLEVHQFYPLVKVQCSAELKTFLCSMYAPVCTVL 

EQALPPCRSLCERARQGCEALMNKFGFQWPDTLKCEKFPVHGAGELCVGQNTSDKGTPTP 

SLLPEFWTSOTQHGGGGHRGGFPGGAGASERGKESCPRALKVPSYLNYHFLGEKDCGAPC 

EPTKVYGLMYFGPEELP^SRTWIGIWSVLCCASTLFTVLTYLVDMRPJFSYPERPIIFLSGCYT 

AVAVAYIAGFLLEDRWCNDKFAEDGARTVAQGTKXEGCT]I.FMMLYFFSMASSIWW 

VILSLTWLAAGMKWGHEAIEANSQYFHLAAWAVPAIKTITILALGQVDGDVLSGVCFVG 

LNNVDALRGFVLAPLFVYLFIGTSFLLAGFVSLFRmTIMKiroGTKTEKLEKXMVRIGVFSV 

LYTVPATIVIACYFYEQA 

FRDQWERSWVAQSCKSYAIPCPHLQAGGGAPPHPPMSPDFTWMIKYLMTLrV'GITSGFWI 
WSGKTLNSW RKFYTRLTNSKQGETTV 



Figure 59 

CGAGTAAAGTTTGCAAAGAGGCGCGGGAGGCGGCAGCCGCAGCGAGGAGGCGGCGGG 

GAAGAAGCGCAGTCTCCGGGTTGGGGGCGGGGGCGGGGGGGGCGCCAAGGAGCCGGG 

TGGGGGGCGGCGGCCAGCATGCGGCCCCGCAGCGCCCTGCCCCGCCTGCTGCTGCCGC 

TGCTGCTGCTGCCCGCCGCCGGGCCGGCCCAGTTCCACGGGGAGAAGGGCATCTCCAT 

CCCGGACCACGGCTTCTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACA 

ACCAGACCATCATGCCCAACCTTCTGGGCCACACGAACCAGGAGGACGCAGGCCTAGA 

GGTGCACCAGTTCTATCCGCTGGTGAAGGTGCAGTGCTCGCCCGAACTGCGCTTCTTCC 

TGTGCTCCATGTACGCACCCGTGTGCACCGTGCTGGAACAGGCCATCCCGCCGTGCCGC 

TCTATCTGTGAGCGCGCGCGCCAGGGCTGCGAAGCCCTCATGAACAAGTTCGGTTTTCA 

GTGGCCCGAGCGCCTGCGCTGCGAGCACTTCCCGCGCCACGGCGCCGAGCAGATCTGC 

GTCGGCCAGAACCACTCCGAGGACGGAGCTCCCGCGCTACTCACCACCGCGCCGCCGC 
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CGGGACTGCAGCCGGGTGCCGGGGGCACCCCGGGTGGCCCGGGCGGCGGCGGCGCTC 
CCCCGCGCTACGCCACGCTGGAGCACCCCTTCCACTGCCCGCGCGTCCTCAAGGTGCCA 
TCCTATCTCAGCTACAAGTTTCTGGGCGAGCGTGATTGTGCTGCGCCCTGCGAACCTGC 
GCGGCCCGATGGTTCCATGTTCTTCTCACAGGAGGAGACGCGTTTCGCGCGCCTCTGGA 
TCCTCACCTGGTCGGTGCTGTGCTGCGCTTCCACCTTCTTCACTGTCACCACGTACTTGG 
TAGACATGCAGCGCTTCCGCTACCCAGAGCGGCCTATCATTTTTCTGTCGGGCTGCTAC 
ACCATGGTGTCGGTGGCCTACATCGCGGGCTTCGTGCTCCAGGAGCGCGTGGTGTGCA 
ACGAGCGCTTCTCCGAGGACGGTTACCGCACGGTGGTGCAGGGCACCAAGAAGGAGG 
GCTGCACCATCCTCTTCATGATGCTCTACTTCTTCAGCATGGCCAGCTCCATCTGGTGG 
GTCATCCTGTCGCTCACCTGGTTCCTGGCAGCCGGCATGAAGTGGGGCCACGAGGCCA 
TCGAGGCCAACTCTCAGTACTTCCACCTGGCCGCCTGGGCCGTGCCGGCCGTCAAGAC 
CATCACCATCCTGGCCATGGGCCAGATCGACGGCGACCTGCTGAGCGGCGTGTGCTTC 
GTAGGCCTCAACAGCCTGGACCCGCTGCGGGGCTTCGTGCTAGCGCCGCTCTTCGTGTA 
CCTGTTCATCGGCACGTCCTTCCTCCTGGCCGGCTTCGTGTCGCTCTTCCGCATCCGCAC 
CATCATGAAGCACGACGGCACCAAGACCGAAAAGCTGGAGCGGCTCATGGTGCGCAT 
CGGCGTCTTCTCCGTGCTCTACACAGTGCCCGCCACCATCGTCATCGCTTGCTACTTCTA 
CGAGCAGGCCTTCCGCGAGCACTGGGAGCGCTCGTGGGTGAGCCAGCACTGCAAGAGC 
' CTGGCCATCCCGTGCCCGGCGCACTACACGCCGCGCATGTCGCCCGACTTCACGGTCTA 
CATGATCAAATACCTCATGACGCTCATCGTGGGCATCACGTCGGGCTTCTGGATCTGGT 
CGGGCAAGACGCTGCACTCGTGGAGGAAGTTCTACACTCGCCTCACCAACAGCCGACA 
CGGTGAGACCACCGTGTGAGGGACGCCCCCAGGCCGGAACCGCGCGGCGCTTTCCTCC 
GCCCGGGGTGGGGCCCCTACAGACTCCGTATTTTATTTTTTTAAATAAAAAACGATCGA 
AACCATTTCACTTTTAGGTTGCTTTTTAAAAGAGAACTCTCTGCCCAACACCCCC 



Figure 60 

MRPRSALPRLLLPLLLLPAAGPAQFHGEKGISIPDHGFCQPISIPLCTDIAYNQTIMPNLLGHT 

NQEDAGLEVHQFYPLVKVQCSPELRFFLCSMYAPVCTVLEQAIPPCRSICERARQGCEALM 

mEGFQWPERLRCEHFPRHGAEQICVGQNHSEDGAPALLTTAPPPGLQPGAGGTPGGPGG 

GGAPPRYATLEHPFHCPRVLKWSYLSYKFLGERDCAAPCEPARPDGSMFFSQEETRFARJL 

WILTWSVECCASTFFTVTTYLVDMQRFRYPERPIIFLSGCYTMVSVAYIAGFVLQERVVCN 

ERFSEDGYRTWQGTKKEGCTILFMMLYFFSMASSIWWILSLTWLAAGMKWGHEAIEA 

NSQYFHLAAWAVPAVKTITILAMGQIDGDLLSGVCFVGLNSLDPLRGFVLAPLFVYLFIGTS 

FLLAGFVSLFRmTIMKHDGTKTEKLERLMVRIGWSVLYTWATIVIACYFYEQAFRE^ 

ERS WVS QHCKSLAIP CP AHYTPRMSPDFTVYMIKYLMTLrVGITS GF WTWS GKTLHS WRKF 

YTRLTNSRHGETTV 

Figure 61 

GCCGCTCCGGGTACCTGAGGGACGCGCGGCCGCCCGCGGCAGGCGGTGCAGCCCCCCC 

CCACCCCTTGGAGCCAGGCGCCGGGGTCTGAGGATAGCATTTCTCAAGACCTGACTTA 

TGGAGCACTTGTAACCTGAGATATTTCAGTTGAAGGAAGAAATAGCTCTTCTCCTAAGA 

TGGAATCTGTGGTTTGGGAATGTGGTTGATCAACTTGATATGTTGGCCAAATGTGCCCC 

ATGTAATAAAATGAAAAGAAGAGACAAGATGATGTCATTTTCCCATATTGTGAAACCA 

AAAACAAACGCCTTTTGTGAGACCAAGCTAACAAACCTCTGACGGTGCGAAGAGTATT 

TAACTGTTTGAAGAATTTAACAGTAAGATACAGAAGAAGTACCTTCGAGCTGAGACCT 

GCAGGTGTATAAATATCTAAAATACATATTGAATAGGCCTGATCATCTGAATCTCCTTC 

AGACCCAGGAAGGATGGCTATGACTTGGATTGTCTTCTCTCTTTGGCCCTTGACTGTGT 

TCATGGGGCATATAGGTGGGCACAGTTTGTTTTCTTGTGAACCTATTACCTTGAGGATG 

TGCCAAGATTTGCCTTATAATACTACCTTCATGCCTAATCTTCTGAATCATTATGACCAA 

CAGACAGCAGCTTTGGCAATGGAGCCATTCCACCCTATGGTGAATCTGGATTGTTCTCG 
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ggatttccggccttttctttgtgcactctacgctcctatttgtatggaatatggacgtgt 

cacacttccctgtcgtaggctgtgtcagcgggcttacagtgagtgttcgaagctcatgg 

agatgtttggtgttccttggcctgaagatatggaatgcagtaggttcccagattgtgat 

gagccatatcctcgacttgtggatctgaatttagctggagaaccaactgaaggagccc 

cagtggcagtgcagagagactatggtttttggtgtccccgagagttaaaaattgatcct 

gatctgggttattcttttctgcatgtgcgtgattgttcacctccttgtccaaatatgtac 

ttcagaagagaagaactgtcatttgctcgctatttcataggattgatttcaatcatttg 

cctctcggccacattgtttacttttttaacttttttgattgatgtcacaagattccgtta 

tcctgaaaggcctattatattttatgcagtctgctacatgatggtatccttaattttctt 

cattggatttttgcttgaagatcgagtagcctgcaatgcatccatccctgcacaatata 

aggcttccacagtgacacaaggatctcataataaagcctgtaccatgctttttatgata 

ctctatttttttactatggctggcagtgtatggtgggtaattcttaccatcacatggttt 

ttagcagctgtgccaaagtggggtagtgaagctattgagaagaaagcattgctgtttc 

acgccagtgcatggggcatccccggaactctaaccatcatccttttagcgatgaataa 

aattgaaggtgacaatattagtggcgtgtgttttgttggcctctacgatgttgatgcat 

tgagatattttgttcttgctcccctctgcctgtatgtggtagttggggtttctctcctctt 

agctggcAttatatccctaaacagagttcgaattgagattccattagaaaaggagaac 

caagataaattagtgaagtttatgatccggatcggtgttttcagcattctttatctcgt 

accactcttggttgtaattggatgctacttttatgagcaagcttaccggggcatctggg 

aaacaacgtggatacaagaacgctgcagagaatatcacattccatgtccatatcaggt 

tactcaaatgagtcgtccagacttgattctctttctgatgaaatacctgatggctctca 

tagttggcattccctctgtattttgggttggaagcaaaaagacatgctttgaatgggcc 

agtttttttcatggtcgtaggaaaaaagagatagtgaatgagagccgacaggtactcc 

aggaacctgattttgctcagtctctcctgagggatccaaatactcctatcataagaaag 

tcaaggggaacttccactcaaggaacatccacccatgcttcttcaactcagctggctat 

ggtggatgatcaaagaagcaaagcaggaagcatccacagcaaagtgagcagctacca 

cggcagcctccacagatcacgtgatggcaggtacacgccctgcagttacagaggaatg 

gaggagagactacctcatggcagcatgtcacgactaacagatcactccaggcatagta 

gttctcatcggctcaatgaacagtcacgacatagcagcatcagagatctcagtaataa 

tcccatgactcatatcacacatggcaccagcatgaatcgggttattgaagaagatgga 

accagtgcttaatttgtcttgtctaaggtggaaatcttgtgctgtttaaaaagcagatt 

ttattctttgccttttgcatgactgatagctgtactcacagttaacatgctttcagtcaa 

gtacagattgtgtccactggaaaggtaaatgattgcttttttatattgcatcaaacttg 

gaacatcaaggcatccaaaacactaagaattctatcatcacaaaaataattcgtctttc 

taggttatgaagagataattatttgtctggtaagcatttttataaacccactcattttat 

atttagaaaaatcctaaatgtgtggtgactgctttgtagtgaactttcatatactataa 

actagttgtgagataacattctggtagctcagttaataaaacaatttcagaattaaag 

aaattttctatgcaaggtttacttctcagatgaacagtaggactttgtagttttatttcc 

actaagtgaaaaaagaactgtgtttttaaactgtaggagaatttaataaatcagcaag 

ggtattttagctaatagaataaaagtgcaacagaagaatttgattagtctatgaaagg 

ttctcttaaaattctatcgaaataatcttcatgcagagatattcagggtttggattagc 

agtggaataaagagAtgggcattgtttcccctataattgtgctgtttttataacttttgt 

aaatattactttttctggctgtgtttttataacttatccatatgcatgatggaaaaattt 

taatttgtagccatcttttcccatgtaatagtattgattcatagagaacttaatgttcaa 

aatttgctttgtggaggcatgtaataagataaacatcatacattataaggtaaccaca 

attacaaaatggcaaaaca 

Figure 62 



MAMTWIWSLWPLTWMGHIGGHSLF 

AMEPFHPMVNLDCSI^FRPFLCALYAPICMEYGRVTLPCRSLCQRAYSECSKLMEMFGVP 
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WTEDMECSRITDCDEPYPRLVDLNLAGEPTEGAPVAVQIUDYGFWCPRELKlDPDLGYSFL 

HVRDCSPPCPNMYFRREELSFARYFIGL 

MMYSLIFFIGFLLEDRVACNASIPAQY]^^ 

LTITWFLAAWKWGSEAffiKKALLFHASAWGIPGTLTIILLAMNKffiGDNISGVCFVGLYDV 

DALRYWLAPLCLYVWGVSLLLAGnSLNRVMEIPLEKENQDKXVKFMIRIGVFSILYLVPL 

LVVIGCYFYEQAYRGIWETTWIQERCREYHIPCPYQVTQMSRPDLILFLMKYLMALIVGIPS 

WWGSKXTCFEWASFFHGPJRKICEIV^ffiSRQVLQEPDFAQSLLRDPNTPIIRKSRGTSTQGT 

STHASSTQLAMVDDQRSKAGSIHSKVSSYHGSLHRSRDGRYTPCSYRGMEERLPHGSMSR 

LTDHSRHSSSHRLI^QSRHSSIRDLSNNPMTHITHGTSMNRVIEEDGTSA 

Figure 63 

GCTGCGCAGCGCTGGCTGCTGGCTGGCCTCGCGGAGACGCCGAACGGACGCGGCCGGC 

GCCGGCTTGTGGGCTCGCCGCCTGCAGCCATGACCCTCGCAGCCTGTCCCTCGGCCTCG 

GCCCGGGACGTCTAAAATCCCACACAGTCGCGCGCAGCTGCTGGAGAGCCGGCCGCTG 

CCCCCTCGTCGCCGCATCACACTCCCGTCCCGGGAGCTGGGAGCAGCGCGGGCAGCCG 

GCGCCCCCGTGCAAACTGGGGGTGTCTGCCAGAGCAGCCCCAGCCGCTGCCGCTGCTA 

CCCCCGATGCTGGCCATGGCCTGGCGGGGCGCAGGGCCGAGCGTCCCGGGGGCGCCCG 

GGGGCGTCGGTCTCAGTCTGGGGTTGCTCCTGCAGTTGCTGCTGCTCCTGGGGCCGGCG 

CGGGGCTTCGGGGACGAGGAAGAGCGGCGCTGCGACCCCATCCGCATCTCCATGTGCC 

AGAACCTCGGCTACAACGTGACCAAGATGCCCAACCTGGTTGGGCACGAGCTGCAGAC 

GGACGCCGAGCTGCAGCTGACAACTTTCACACCGCTCATCCAGTACGGCTGCTCCAGC 

CAGCTGCAGTTCTTCCTTTGTTCTGTTTATGTGCCAATGTGCACAGAGAAGATCAACAT 

CCCCATTGGCCCATGCGGCGGCATGTGTCTTTCAGTCAAGAGACGCTGTGAACCCGTCC 

TGAAGGAATTTGGATTTGCCTGGCCAGAGAGTCTGAACTGCAGCAAATTCCCACCACA 

GAACGACCACAACCACATGTGCATGGAAGGGCCAGGTGATGAAGAGGTGCCCTTACCT 

CACAAAACCCCCATCCAGCCTGGGGAAGAGTGTCACTCTGTGGGAACCAATTCTGATC 

AGTACATCTGGGTGAAAAGGAGCCTGAACTGTGTGCTCAAGTGTGGCTATGATGCTGG 

CTTATACAGCCGCTCAGCCAAGGAGTTCACTGATATCTGGATGGCTGTGTGGGCCAGCC 

TGTGTTTCATCTCCACTGCCTTCACAGTACTGACCTTCCTGATCGATTCTTCTAGGTTTT 

CCTACCCTGAGCGCCCCATCATATTTCTCAGTATGTGCTATAATATTTATAGCATTGCTT 

ATATTGTCAGGCTGACTGTAGGCCGGGAAAGGATATCCTGTGATTTTGAAGAGGCAGC 

AGAACCTGTTCTCATCCAAGAAGGACTTAAGAACACAGGATGTGCAATAATTTTCTTGC 

TGATGTACTTTTTTGGAATGGCCAGCTCCATTTGGTGGGTTATTCTGACACTCACTTGGT 

TTTTGGCAGCAGGACTCAAATGGGGTCATGAAGCCATTGAAATGCACAGCTCTTATTTC 

CACATTGCAGCCTGGGCCATCCCCGCAGTGAAAACCATTGTCATCTTGATTATGAGACT 

GGTGGATGCAGATGAACTGACTGGCTTGTGCTATGTTGGAAACCAAAATCTCGATGCC 

CTCACCGGGTTCGTGGTGGCTCCCCTCTTTACTTATTTGGTCATTGGAACTTTGTTCATT 

GCTGCAGGTTTGGTGGCCTTGTTCAAAATTCGGTCAAATCTTCAAAAGGATGGGACAA 

AGACAGACAAGTTAGAAAGACTGATGGTCAAGATTGGGGTGTTCTCAGTACTGTACAC 

AGTTCCTGCAACGTGTGTGATTGCCTGTTATTTTTATGAAATCTCCAACTGGGCACTTTT 

TCGGTATTCTGCAGATGATTCCAACATGGCTGTTGAAATGTTGAAAACTTTTATGTCTTT 

GTTGGTGGGCATCACTTCAGGCATGTGGATTTGGTCTGCCAAAAGTCTTCACACGTGGC 

AGAAGTGTTCCAACAGATTGGTGAATTCTGGAAAGGTAAAGAGAGAGAAGAGAGGAA 

ATGGTTGGGTGAAGCCTGGAAAAGGCAGTGAGACTGTGGTATAAGGCTAGTCAGCCTC 

CATGCTTTCTTCATTTTGAAGGGGGGAATGCCAGCATTTTGGAGGAAATTCTACTAAAA 

GTTTTATGCAGTGAATCTCAGTTTGAACAAACTAGCAACAATTAAGTGACCCCCGTCAA 

CCCACTGCCTCCCACCCCGACCCCAGCATCAAAAAACCAATGATTTTGCTGCAGACTTT 

GGAATGATCCAAAATGGAAAAGCCAGTTAGAGGCTTTCAAAGCTGTGAAAAATCAAA 

ACGTTGATCACTTTAGCAGGTTGCAGCTTGGAGCGTGGAGGTCCTGCCTAGATTCCAGG 

AAGTCCAGGGCGATACTGTTTTCCCCTGCAGGGTGGGATTTGAGCTGTGAGTTGGTAAC 

TAGCAGGGAGAAATATTAACTTTTTTAACCCTTTACCATTTTAAATACTAACTGGGTCT 
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ttcagatagcaaagcaatctataaacactggaaacgctgggttcagaaaagtgttaca 

agagttttatagtttggctgatgtaacataaacatcttctgtggtgcgctgtctgctgtt 

tagaactttgtggactgcactcccaa.gaagtggtgttagaatctttcagtgcctttgtc 

ataaaacagttatttgaacaaacaaaagtactgtactcacacacataaggtatccagt 

ggatttttcttctctgtcttcctctcttaaatttcaacatctctcttcttggctgctgctg 

ttttcttcattttatgttaatgactcaaaaaaggtatttttatagaatttttgtactgca 

gcatgcttaaagaggggaaaaggaagggtgattcactttctgacaatcacttaattca 

gaggaaaatgagatttactaagttgacttacctgacggaccccagagacctattgcat 

tgagcagtggggacttaatatattttacttgtgtgattgcatctatgcagacgccagtc 

tggaagagctgaaatgttaagtttcttggcaactttgcattcacacagattagctgtgt 

aatttttgtgtgtcaattacaattaaaagcacattgttggaccatgacatagtatactc 

aactgactttaaaactatggtcaacttcaacttgcattctcagaatgatagtgccttta 

aaatttttttattttttaaagcataagaatgttatcagaatctggtctacttaggacaa 

tggagactttttcagttttataaagggaactgaggacagctaatccaactacttggtgc 

tgtaattgtttcctagtaattggcaaaggctccttgtaagatttcactggaggcagtgt 

ggcctggagtatttatatggtgcttaatgaatctccagaatgccagccagaagcctgat 

tggttagtagggaataaagtgtagaccatatgaaatgaactgcaaactctaatagccc 

aggtcttaattgcctttagcagaggtatccaaagcttttaaaatttatgcatacgttct 

tcacaagggggtacccccagcagcctctcgaaaattgcacttctcttaaaactgtaact 

ggcctttctcttaccttgccttaggccttctaatcatgagatcttggggacaAattgact 

atgtcacaggttgctctccttgtaactcatacctgtctgcttcagcaactgctttgcaat 

gacatttatttattaattcatgccttaaaaaaataggaagggaagcttttttttttcttt 

ttttttttttcaatcacactttgtggaaaaacatttccagggactcaaaattccaaaaa 

ggtggtcaaattctggaagtaagcatttcctcttttttaaaaatttggtttgagccttat 

gcccatagtttgacatttccctttcttctttcctttttgtttttgtgtggttcttgagctc 

tctgacatcaagatgcatgtaaagtcgattgtatgttttgaaggcaaagtcttggcttt 

tgagactgaagttaagtgggcacaggtggcccctgctgctgtgcccagtctgagtacc 

ttggctagactctaggtcaggctccaggagcatgagaattgatccccagaagaaccat 

tttaactccatctgatactccattgcctatgaaatgtaaaatgtgaactccctgtgctg 

cttgtagacagttcccataactgtccacggccctggagcacgcacccaggggcagagc 

ctgcccttactcacgctctgctctggtgtcttgggagttgtgcagggactctggcccag 

gcaggggaaggaagaccaggcggtaggggactggtcttgctgttagagtatagaggtt 

tgtaatgcagttttcttcataatgtgtcagtgattgtgtgaccaaggcagcatctagca 

gaaagccaggcatggagtaggtgatcgatacttgtcaatgactaaataataacaataa 

aagagcacttgggtgaatctgggcacctgatttctgagttttgagttctggagctagtg 

ttttgacaatgctttgggttttgacatgccttttccacaaatctcttgccttttcagggc 

aaagtgtatttgatcagaagtggccatttggattagtagccttagcaatgctacagggt 

tataggcccctctccctttcacattccagacaatggagagtgtttatggtttcaggaaa 

agaactttgtggctgaggggtcagttaccagtgaccttcaatcaactccatcacttctt 

aaatcggtatttgttaaaaaaatcagttattttatttattgagtgccgactgtagtaaa 

gccctgaaatagataatctctgttcttctaactgatctaggatggggacgcacccaggt 

ctgctgaactttactgttcctctgggaaaggagcagggacctctggaattcccatctgt 

ttcactgtctccattccataaatctcttcctgtgtgagccaccacacccagcctgggtct 

ctctacttttaacacatctctcatccctttcccaggacttccttccaagtcagttacagg 

tggttttaacagaaagcatcagctctgcttcgtgacagtctctggagaaatcccttagg 

aagactatgagagtaggccacaaggacatgggcccacacatctgctttggctttgccg 

gcaattcagggcttggggtattccatgtgacttgtataggtatatttgaggacagcatc 

ttgctagagaaaaggtgagggttgtttttctttctctgaaacctacagtaaatgggtat 

gattgtagcttcctcagaaatcccttggcctccagagattaaacatggtgcaatggcac 

ctctgtccaacctcctttctggtagattcctttctcctgcttcatataggccaaacctca 

gggcaagggaacatgggggtagagtggtgctggccagaaccatctgcttgagctactt 

ggttgattcatatcctctttcctttatggagacccatttcctgatctctgagactgttgc 
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TGAACTGGCAACTTACTTGGGCCTGAAACTGGAGAAGGGGTGACATTTTTTTAATTTCA 

GAGATGCTTTCTGATTTTCCTCTCCCAGGTCACTGTCTCACCTGCACTCTCCAAACTCAG 

GTTCCGGGAAGCTTGTGTGTCTAGATACTGAATTGAGATTCTGTTCAGCACCTTTTAGC 

TCTATACTCTCTGGCTCCCCTCATCCTCATGGTCACTGAATTAAATGCTTATTGTATTGA 

GAACCAAGATGGGACCTGAGGACACAAAGATGAGCTCAACAGTCTCAGCCCTAGAGG 

AATAGACTCAGGGATTTCACCAGGTCGGTGCAGTATTTGATTTCTGGTGAGGTGACCAC 

AGCTGCAGTTAGGAAGGGAGCCATTGAGCACAGACTTTGGAAGGAACCTTTTTTTTGTT 

GTTTGTTTGTTTGTTTGTTTGTTTGTTTGTTTGAGACAGGGTCTTGCTCTGCTACCCAGG 

CTGGGGCGCAATGGCACGATCTTGGCTCACTGCAACCTCTGCCTCCTGGGTTCAAGTGA 

TTCTCCTGCCACAGCCTCCTGAGGAGCTGGGACTACAGGTGCGTGCTACCACGCCCAG 

CTACTTCTGTATTTTTAGTAGAGACGGGGTTTCACTGTGTTGGCCAGGCTGGTCTCGAA 

CTCCTGACCTCATGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAAGTGTGA 

GCCACCACACCTGGCCTGGAAGGAACCTCTTAAAATCAGTTTACGTCTTGTATTTTGTT 

CTGTGATGGAGGACACTGGAGAGAGTTGCTATTCCAGTCAATCATGTCGAGTCACTGG 

ACTCTGAAAATCCTATTGGTTCCTTTATTTTATTTGAGTTTAGAGTTCCCTTCTGGGTTT. 

GTATTATGTCTGGCAAATGACCTGGGTTATCACTTTTCCTCCAGGGTTAGATCATAGAT 

CTTGGAAACTCCTTAGAGAGCATTTTGCTCCTACCAAGGATCAGATACTGGAGCCCCAC 

ATAATAGATTTCATTTCACTCTAGCCTACATAGAGCTTTCTGTTGCTGTCTCTTGCCATG 

CACTTGTGCGGTGATTACACACTTGACAGTACCAGGAGACAAATGACTTACAGATCCC 

CCGACATGCCTCTTCCCCTTGGCAAGCTCAGTTGCCCTGATAGTAGCATGTTTCTGTTTC 

TGATGTACCTTTTTTCTCTTCTTCTTTGCATCAGCCAATTCCCAGAATTTCCCCAGGCAA 

TTTGTAGAGGACCTTTTTGGGGTCCTATATGAGCCATGTCCTCAAAGCTTTTAAACCTC 

CTTGCTCTCCTACAATATTCAGTACATGACCACTGTCATCCTAGAAGGCTTCTGAAAAG 

AGGGGCAAGAGCCACTCTGCGCCACAAAGGTTGGATCCATCTTCTCTCCGAGGTTGTG 

AAAGTTTTCAAATTGTACTAATAGGCTGGGGCCCTGACTTGGCTGTGGGCTTTGGGAGG 

GGTAAGCTGCTTTCTAGATCTCTCCCAGTGAGGCATGGAGGTGTTTCTGAATTTTGTCT 

ACCTCACAGGGATGTTGTGAGGCTTGAAAAGGTCAAAAAATGATGGCCCCTTGAGCTC 

TTTGTAAGAAAGGTAGATGAAATATCGGATGTAATCTGAAAAAAAGATAAAATGTGAC 

TTCCCCTGCTCTGTGCAGCAGTCGGGCTGGATGCTCTGTGGCNTTTCTTGGGTCCTCATG 

CCACCCCACAGCTCCAGGAACCTTGAAGCCAATCTGGGGACTTTCAGATGTTTGACAA 

AGAGGTACCAGGCAAACTTCCTGCTACACATGCCCTGAATGAATTGCTAAATTTCAAA 

GGAAATGGACCCTGCTTTTAAGGATGTACAAAAGTATGTCTGCATCGATGTCTGTACTG 

TAAATTTCTAATTTATCACTGTACAAAGAAAACCCCTTGCTATTTAATTTTGTATTAAAG 

GAAAATAAAGTTTTGTTTGTTAAAAAAAAA 



Figure 64 



MAWRGAGPSWGAPGGVGLSLGLLLQLLLLLGPARGFGDEEERRCDPIEISMCQNLGYNV 

TKMPNLVGHELQTD AELQLTTFTPLIQ YGCS S QLQFFLCS VYVPMCTEKINIP IGP C GGMCL 
SVKI^CEPVLKEFGFAWPESLNCSKEPPQm)HNHMCMEGPGDEEVPLPHKTPIQPGEECHS 
VGTNSDQYlWA^CRSLNCVLKCGYDAGLYSRSAKEFTDrWMAVWASLCFISTAFTVLTFLID 
SSP^SYPERPEFLSMCYNTYSIAYrVIG^WGRERISCDFEEAAEPVLIQEGLKNT^ 

YFFGMASSIWWVILTLTWFLAAGLKWGHEAIEMHSSYFHIAA 

DELTGLCYVGNQNLDALTGFWAPLFTYLVIGTLFIAAGLVAEFKIRSNLQKDGTKTDKLE 

PvEMVEJGWSVLYTWATCVIACYFYEISNWAEFRYSADDSNMAVE 
SGMWIWSAKSLHTWQKCSNRLVNSGKVKREKRGNGWVKPGKGSETVV 
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Figure 65 



ACCCAGGGACGGAGGACCCAGGCTGGCTTGGGGACTGTCTGCTCTTCTCGGCGGGAGC 

CGTGGAGAGTCCTTTCCCTGGAATCCGAGCCCTAACCGTCTCTCCCCAGCCCTATCCGG 

CGAGGAGCGGAGCGCTGCCAGCGGAGGCAGCGCCTTCCCGAAGCAGTTTATCTTTGGA 

CGGTTTTCTTTAAAGGAAAAACGAACCAACAGGTTGCCAGCCCCGGCGCCACACACGA 

GACGCCGGAGGGAGAAGCCCCGGCCCGGATTCCTCTGCCTGTGTGCGTCCCTCGCGGG 

CTGCTGGAGGCGAGGGGAGGGAGGGGGCGATGGCTCGGCCTGACCCATCCGCGCCGC 

CCTCGCTGTTGCTGCTGCTCCTGGCGCAGCTGGTGGGCCGGGCGGCCGCCGCGTCCAA 

GGCCCCGGTGTGCCAGGAAATCACGGTGCCCATGTGCCGCGGCATCGGCTACAACCTG 

ACGCACATGCCCAACCAGTTCAACCACGACACGCAGGACGAGGCGGGCCTGGAGGTG 

CACCAGTTCTGGCCGCTGGTGGAGATCCAATGCTCGCCGGACCTGCGCTTCTTCCTATG 

CACTATGTACACGCCCATCTGTCTGCCCGACTACCACAAGCCGCTGCCGCCCTGCCGCT 

CGGTGTGCGAGCGCGCCAAGGCCGGCTGCTCGCCGCTGATGCGCCAGTACGGCTTCGC 

CTGGCCCGAGCGCATGAGCTGCGACCGCCTCCCGGTGCTGGGCCGCGACGCCGAGGTC 

CTCTGCATGGATTACAACCGCAGCGAGGCCACCACGGCGCCCCCCAGGCCTTTCCCAG 

CCAAGCCCACCCTTCCAGGCCCGCCAGGGGCGCCGGCCTCGGGGGGCGAATGCCCCGC 

TGGGGGCCCGTTCGTGTGCAAGTGTCGCGAGCCCTTCGTGCCCATTCTGAAGGAGTCAC 

ACCCGCTCTACAACAAGGTGCGGACGGGCCAGGTGCCCAACTGCGCGGTACCCTGCTA 

CCAGCCGTCCTTCAGTGCCGACGAGCGCACGTTCGCCACCTTCTGGATAGGCCTGTGGT 

CGGTGCTGTGCTTCATCTCCACGTCCACCACAGTGGCCACCTTCCTCATCGACATGGAC 

ACGTTCCGCTATCCTGAGCGCCCCATCATCTTCCTGTCAGCCTGCTACCTGTGCGTGTC 

GCTGGGCTTCCTGGTGCGTCTGGTCGTGGGCCATGCCAGCGTGGCCTGCAGCCGCGAG 

CACAACCAGATCCACTACGAGACCACGGGCCCTGCACTGTGCACCATCGTCTTCCTCCT 

GGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCTGGTT 

CCTGGCCGCCGCGATGAAGTGGGGCAACGAGGCCATCGCGGGCTACGGCCAGTACTTC 

CACCTGGCTGCGTGGCTCATCCCCAGCGTCAAGTCCATCACGGCACTGGCGCTGAGCTC 

CGTGGACGGGGACCCAGTGGCCGGCATCTGCTACGTGGGCAACCAGAACCTGAACTCG 

CTGCGGCGCTTCGTGCTGGGCCCGCTGGTGCTCTACCTGCTGGTGGGCACGCTCTTCCT 

GCTGGCGGGCTTCGTGTCGCTCTTCCGCATCCGCAGCGTCATCAAGCAGGGCGGCACC 

AAGACGGACAAGCTGGAGAAGCTCATGATCCGCATCGGCATCTTCACGCTGCTCTACA 

CGGTCCCCGCCAGCATTGTGGTGGCCTGCTACCTGTACGAGCAGCACTACCGCGAGAG 

CTGGGAGGCGGCGCTCACCTGCGCCTGCCCGGGCCACGACACCGGCCAGCCGCGCGCC 

AAGCCCGAGTACTGGGTGCTCATGCTCAAGTACTTCATGTGCCTGGTGGTGGGCATCAG 

GTCGGGCGTCTGGATCTGGTCGGGCAAGACGGTGGAGTCGTGGCGGCGTTTCACCAGC 

CGCTGCTGCTGCCGCCCGCGGCGCGGCCACAAGAGCGGGGGCGCCATGGCCGCAGGG 

GACTACCCCGAGGCGAGCGCCGCGCTCACAGGCAGGACCGGGCCGCCGGGCCCCGCC 

GCCACCTACCACAAGCAGGTGTCCCTGTCGCACGTGTAGGAGGCTGCCGCCGAGGGAC 

TCGGCCGGAGAGCTGAGGGGAGGGGGGCGTTTTGTTTGGTAGTTTTGCCAAGGTCACT 

TCCGTTTACCTTCATGGTGCTGTTGCCCCCTCCCGCGGCGACTTGGAGAGAGGGAAGAG 

GGGCGTTTTCGAGGAAGAACCTGTCCCAGGTCTTCTCCAAGGGGCCCAGCTCACGTGT 

ATTCTATTTTGCGTTTCTTACCTGCCTTCTTTATGGGAACCCTCTTTTTAATTTATATGTA 

T 



Figure 66 
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MARPDPSAPPSLLLLLLAQLVGRAAAASKAPVCQEITWMCRGIGYNLTHMPNQFNHDTQ 

DEAGLEVHQFWLVEIQCSPDLRFFLCTMYTPICLPDYHKPLPPCRSVCERAKAGCSPLMR 

QYGFAWPEPJMSCDPJLPVLGRDAEVLCMDYNRSEATTAPPRPFPAKPTLPGPPGAPASGGE 

CPAGGPFVCKCREPFVPILKESHPLYMCVRTGQVPNCAVPCYQPSFSADERTFATFWIGLW 

SVLCFISTSTTVATFLroMDTFRYPERPDFLSACYLCVSLGFLVRLVVGHASVACSREHNHIH 

YETTGPALCTIVFLLVYFFGMASSrWl^ILSLTWLAAAMKWGNEAIAGYGQYFHLAAWX 

1PSVKSITALALSSVDGDPVAGICYVGNQKLNSLRRFVLGPLVLYLLVGTLFLLAGFVSLFRI 

RSVIKQGGTKTDKLEKLMIRIG]FTLLYTWAS]WACYLYEQHYRESWEAALTCACPGHD 

TGQPRAKPEYWLMLKYFMCLWGITSGVWrWSGKTVESWRRFTSRCCCRPRRGHKSGG 

AMAAGDYPEASAALTGRTGPPGPAATYHKQVSLSHV 



Figure 67 

GCAGCTCCAGTCCCGGACGCAACCCCGGAGCCGTCTCAGGTCCCTGGGGGGAACGGTG 

GGTTAGACGGGGACGGGAAGGGACAGCGGCCTTCGACCGCCCCCCGAGTAATTGACCC 

AGGACTCATTTTCAGGAAAGCCTGAAAATGAGTAAAATAGTGAAATGAGGAATTTGAA 

CATTTTATCTTTGGATGGGGATCTTCTGAGGATGCAAAGAGTGATTCATCCAAGCCATG 

TGGTAAAATCAGGAATTTGAAGAAAATGGAGATGTTTACATTTTTGTTGACGTGTATTT 

TTCTACCCCTCCTAAGAGGGCACAGTCTCTTCACCTGTGAACCAATTACTGTTCCCAGA 

TGTATGAAAATGGCCTACAACATGACGTTTTTCCCTAATCTGATGGGTCATTATGACCA 

GAGTATTGCCGCGGTGGAAATGGAGCATTTTCTTCCTCTCGCAAATCTGGAATGTTCAC 

CAAACATTGAAACTTTCCTCTGCAAAGCATTTGTACCAACCTGCATAGAACAAATTCAT 

GTGGTTCCACCTTGTCGTAAACTTTGTGAGAAAGTATATTCTGATTGCAAAAAATTAAT 

TGACACTTTTGGGATCCGATGGCCTGAGGAGCTTGAATGTGACAGATTACAATACTGTG 

ATGAGACTGTTCCTGTAACTTTTGATCCACACACAGAATTTCTTGGTCCTCAGAAGAAA 

ACAGAACAAGTCCAAAGAGACATTGGATTTTGGTGTCCAAGGCATCTTAAGACTTCTG 

GGGGACAAGGATATAAGTTTCTGGGAATTGACCAGTGTGCGCCTCCATGCCCCAACAT 

GTATTTTAAAAGTGATGAGCTAGAGTTTGCAAAAAGTTTTATTGGAACAGTTTCAATAT 

TTTGTCTTTGTGCAACTCTGTTCACATTCCTTACTTTTTTAATTGATGTTAGAAGATTCA 

GATACCCAGAGAGACCAATTATATATTACTCTGTCTGTTACAGCATTGTATCTCTTATG 

TACTTCATTGGATTTTTGCTGGGCGATAGCACAGCCTGCAATAAGGCAGATGAGAAGC 

TAGAACTTGGTGACACTGTTGTCCTAGGCTCTCAAAATAAGGCTTGCACCGTTTTGTTC 

ATGCTTTTGTATTTTTTCACAATGGCTGGCACTGTGTGGTGGGTGATTCTTACCATTACT 

TGGTTCTTAGCTGCAGGAAGAAAATGGAGTTGTGAAGCCATCGAGCAAAAAGCAGTGT 

GGTTTCATGCTGTTGCATGGGGAACACCAGGTTTCCTGACTGTTATGCTTCTTGCTCTGA 

ACAAAGTTGAAGGAGACAACATTAGTGGAGTTTGCTTTGTTGGCCTTTATGACCTGGAT 

GCTTCTCGCTACTTTGTACTCTTGCCACTGTGCCTTTGTGTGTTTGTTGGGCTCTCTCTTC 

TTTTAGCTGGCATTATTTCCTTAAATCATGTTCGACAAGTCATACAACATGATGGCCGG 

AACCAAGAAAAACTAAAGAAATTTATGATTCGAATTGGAGTCTTCAGCGGCTTGTATC 

TTGTGCCATTAGTGACACTTCTCGGATGTTACGTCTATGAGCAAGTGAACAGGATTACC 

TGGGAGATAACTTGGGTGTCTGATCATTGTCGTCAGTACCATATCCCATGTCCTTATCA 

GGCAAAAGCAAAAGCTCGACCAGAATTGGCTTTATTTATGATAAAATACCTGATGACA 

TTAATTGTTGGCATCTCTGCTGTCTTCTGGGTTGGAAGCAAAAAGACATGCACAGAATG 

GGCTGGGTTTTTTAAACGAAATCGCAAGAGAGATCCAATCAGTGAAAGTCGAAGAGTA 

CTACAGGAATCATGTGAGTTTTTCTTAAAGCACAATTCTAAAGTTAAACACAAAAAGA 

AGCACTATAAACCAAGTTCACACAAGCTGAAGGTCATTTCCAAATCCATGGGAACCAG 

CACAGGAGCTACAGCAAATCATGGCACTTCTGCAGTAGCAATTACTAGCCATGATTAC 

CTAGGACAAGAAACTTTGACAGAAATCCAAACCTCACCAGAAACATCAATGAGAGAG 

GTGAAAGCGGACGGAGCTAGCACCCCCAGGTTAAGAGAACAGGACTGTGGTGAACCT 

GCCTCGCCAGCAGCATCCATCTCCAGACTCTCTGGGGAACAGGTCGACGGGAAGGGCC 

AGGCAGGCAGTGTATCTGAAAGTGCGCGGAGTGAAGGAAGGATTAGTCCAAAGAGTG 
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ATATTACTGACACTGGCCTGGCACAGAGCAACAATTTGCAGGTCCCCAGTTCTTCAGAA 

CCAAGCAGCCTCAAAGGTTCCACATCTCTGCTTGTTCACCCAGTTTCAGGAGTGAGAAA 

AGAGCAGGGAGGTGGTTGTCATTCAGATACTTGAAGAACATTTTCTCTCGTTACTCAGA 

AGCAAATTTGTGTTACACTGGAAGTGACCTATGCACTGTTTTGTAAGAATCACTGTTAC 

GTTCTTCTTTTGCACTTAAAGTTGCATTGCCTACTGTTATACTGGAAAAAATAGAGTTC 

AAGAATAATATGACTCATTTCACACAAAGGTTAATGACAACAATATACCTGAAAACAG 

AAATGTGCAGGTTAATAATATTTTTTTAATAGTGTGGGAGGACAGAGTTAGAGGAATC 

TTCCTTTTCTATTTATGAAGATTCTACTCTTGGTAAGAGTATTTTAAGATGTACTATGCT 

ATTTTACCTTTTTGATATAAAATCAAGATATTTCTTTGCTGAAGTATTTAAATCTTATCC 

TTGTATCTTTTTATACATATTTGAAAATAAGCTTATATGTATTTGAACTTTTTTGAAATC 

CTATTCAAGTATTTTTATCATGCTATTGTGATATTTTAGCACTTTGGTAGCTTTTACACT 

GAATTTCTAAGAAAATTGTAAAATAGTCTTCTTTTATACTGTAAAAAAAGATATACCAA 

AAAGTCTTATAATAGGAATTTAACTTTAAAAACCCACTTATTGATACCTTACCATCTAA 

AATGTGTGATTTTTATAGTCTCGTTTTAGGAATTTCACAGATCTAAATTATGTAACTGA 

AATAAGGTGCTTACTCAAAGAGTGTCCACTATTGATTGTATTATGCTGCTCACTGATCC 

TTCTGCATATTTAAAATAAAATGTCCTAAAGGGTTAGTAGACAAAATGTTAGTCTTTTG 

TATATTAGGCCAAGTGCAATTGACTTCCCTTTTTTAATGTTTCATGACCACCCATTGATT 

GTATTATAACCACTTACAGTTGCTTATATTTTTTGTTTTAACTTTTGTTTCTTAACATTTA 

GAATATTACATTTTGTATTATACAGTACCTTTCTCAGACATTTTGTAG 



Figure 68 



MEMFTFLLTCJFLPLLRGHSLFTCEPIT^ 
LPLAMJ3CSPNIETFLCILfy?W 

RLQY CDET VP VTFDPHTEFLGPQKKTEQ VQRDIGFWCPPvHLKTS GGQG YKFLGID QC APP C 
PNMYFKSDELEFAKSFIGTVSIFCLCATLFTFLTFLroVRR^ 

GFLLGDSTACNEADEKLELGDTVVLGSQNEACTVLFMLLYFFTMAGTVWWVILTITWFLA 
AGEKWSCEAIEQKAVWFHAVAWGTPGFLTVMLLALNKVEGDNISGVCFVGLYDLDASRY 

FVLIJPLCLCWVGLSLLLAGnSLNEW^ 
GCYVYEQVNRITWEITWVSDHCRQ 

WWGSKXTCTEWAGFFKRNRKRDPISESPJtVLQESCEFFL 

LKVISKSMGTSTGATA^GTSAVAn-SHDYLGQETLTEIQTSPETSMPJEVKADGASTPRLRE 
QDCGEPASPAASISRLSGEQVDGKGQAGSVSESARSEGRISPKSDITDTGLAQSNNLQVPSSS 
EPSSLKGSTSLLVHPVSGVRKEQGGGCHSDT 



Figure 69 

CTCTCCCAACCGCCTCGTCGCACTCCTGAGGCTGAGAGCACCGCTGCACTCGCGGCCGG 

CGATGCGGGACCCCGGCGCGGCCGCTCCGCTTTCGTCCCTGGGCCTCTGTGCCCTGGTG 

CTGGCGCTGCTGGGCGCACTGTCCGCGGGCGCCGGGGCGCAGCCGTACCACGGAGAGA 

AGGGCATCTCCGTGCCGGACCACGGCTTCTGCCAGCCCATCTCCATCCCGCTGTGCACG 

GACATCGCCTACAACCAGACCATCCTGCCCAACCTGCTGGGCCACACGAACCAAGAGG 

ACGCGGGCCTCGAGGTGCACCAGTTCTACCCGCTGGTGAAGGTGCAGTGTTCTCCCGA 

ACTCCGCTTTTTCTTATGCTCCATGTATGCGCCCGTGTGCACCGTGCTCGATCAGGCCAT 

CCCGCCGTGTCGTTCTCTGTGCGAGCGCGCCCGCCAGGGCTGCGAGGCGCTCATGAAC 

AAGTTCGGCTTCCAGTGGCCCGAGCGGCTGCGCTGCGAGAACTTCCCGGTGCACGGTG 

CGGGCGAGATCTGCGTGGGCCAGAACACGTCGGACGGCTCCGGGGGCCCAGGCGGCG 

GCCCCACTGCCTACCCTACCGCGCCCTACCTGCCGGACCTGCCCTTCACCGCGCTGCCC 

CCGGGGGCCTCAGATGGCAGGGGGCGTCCCGCCTTCCCCTTCTCATGCCCCCGTCAGCT 
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CAAGGTGCCCCCGTACCTGGGCTACCGCTTCCTGGGTGAGCGCGATTGTGGCGCCCCGT 

GCGAACCGGGCCGTGCCAACGGCCTGATGTACTTTAAGGAGGAGGAGAGGCGCTTCGC 

CCGCCTCTGGGTGGGCGTGTGGTCCGTGCTGTGCTGCGCCTCGACGCTCTTTACCGTTC 

TCACCTACCTGGTGGACATGCGGCGCTTCAGCTACCCAGAGCGGCCCATCATCTTCCTG 

TCGGGCTGCTACTTCATGGTGGCCGTGGCGCACGTGGCCGGCTTCCTTCTAGAGGACCG 

CGCCGTGTGCGTGGAGCGCTTCTCGGACGATGGCTACCGCACGGTGGCGCAGGGCACC 

AAGAAGGAGGGCTGCACCATCCTCTTCATGGTGCTCTACTTCTTCGGCATGGCCAGCTC 

CATCTGGTGGGTCATTCTGTCTCTCACTTGGTTCCTGGCGGCCGGCATGAAGTGGGGCC 

ACGAGGCCATCGAGGCCAACTCGCAGTACTTCCACCTGGCCGCGTGGGCCGTGCCCGC 

CGXCAAGACCATCACTATCCTGGCCATGGGCCAGGTAGACGGGGACCTGCTGAGCGGG 

GTGTGCTACGTTGGCCTCTCCAGTGTGGACGCGCTGCGGGGCTTCGTGCTGGCGCCTCT 

GTTCGTCTACCTCTTCATAGGCACGTCCTTCTTGCTGGCCGGCTTCGTGTCCCTCTTCCG 

TATCCGCACCATCATGAAACACGACGGCACCAAGACCGAGAAGCTGGAGAAGCTCAT 

GGTGCGCATCGGCGTCTTCAGCGTGCTCTACACAGTGCCCGCCACCATCGTCCTGGCCT 

GCTACTTCTACGAGCAGGCCTTCCGCGAGCACTGGGAGCGCACCTGGCTCCTGCAGAC 

GTGCAAGAGCTATGCCGTGCCCTGCCCGCCCGGCCACTTCCCGCCCATGAGCCCCGACT 

TCACCGTCTTCATGATCAAGTACCTGATGACCATGATCGTCGGCATCACCACTGGCTTC 

TGGATCTGGTCGGGCAAGACCCTGCAGTCGTGGCGCCGCTTCTACCACAGACTTAGCC 

ACAGCAGCAAGGGGGAGACTGCGGTATGAGCCCCGGCCCCTCCCCACCTTTCCCACCC 

CAGCCCTCTTGCAAGAGGAGAGGCACGGTAGGGAAAAGAACTGCTGGGTGGGGGCCT 

GTTTCTGTAACTTTCTCCCCCTCTACTGAGAAGTGACCTGGAAGTGAGAAGTTCTTTGC 

AGATTTGGGGCGAGGGGTGATTTGGAAAAGAAGACCTGGGTGGAAAGCGGTTTGGAT 

GAAAAGATTTCAGGCAAAGACTTGCAGGAAGATGATGATAACGGCGATGTGAATCGTC 

AAAGGTACGGGCCAGCTTGTGCCTAATAGAAGGTTGAGACCAGCAGAGACTGCTGTGA 

GTTTCTCCCGGCTCGGAGGCTGAACGGGGACTGTGAGCGATCCCCCTGCTGCAGGGCG 

AGTGGCCTGTCCAGACCCCTGTGAGGCCCCGGGAAAGGTACAGCCCTGTCTGCGGTGG 

CTGCTTTGTTGGAAAGAGGGAGGGCCTCCTGCGGTGTGCTTGTCAAGCAGTGGTCAAA 

CCATAATCTCTTTTCACTGGGGCCAAACTGGAGCCCAGATGGGTTAATTTCCAGGGTCA 

GACATTACGGTCTCTCCTCCCCTGCCCCCTCCCGCCTGTTTTTCCTCCCGTACTGCTTTC 

AGGTCTTGTAAAATAAGCATTTGGAAGTCTTGGGAGGCCTGCCTGCTAGAATCCTAATG 

TGAGGATGCAAAAGAAATGATGATAACATTTTGAGATAAGGCCAAGGAGACGTGGAG 

TAGGTATTTTTGCTACTTTTTCATTTTCTGGGGAAGGCAGGAGGCAGAAAGACGGGTGT 

TTTATTTGGTCTAATACCCTGAAAAGAAGTGATGACTTGTTGCTTTTCAAAACAGGAAT 

GCATTTTTCCCCTTGTCTTTGTTGTAAGAGACAAAAGAGGAAACAAAAGTGTCTCCCTG 

TGGAAAGGCATAACTGTGACGAAAGCAACTTTTATAGGCAAAGCAGCGCAAATCTGAG 

GTTTCCCGTTGGTTGTTAATTTGGTTGAGATAAACATTCCTTTTTAAGGAAAAGTGAAG 

AGCAGTGTGCTGTCACACACCGTTAAGCCAGAGGTTCTGACTTCGCTAAAGGAAATGT 

AAGAGGTTTTGTTGTCTGTTTTAAATAAATTTAATTCGGAACACATGATCCAACAGACT 

ATGTTAAAATATTCAGGGAAATCTCTCCCTTCATTTACTTTTTCTTGCTATAAGCCTATA 

TTTAGGTTTCTTTTCTATTTTTTTCTCCCATTTGGATCCTTTGAGGTAAAAAAACATAAT 

GTCTTCAGCCTCATAATAAAGGAAAGTTAATTAAAAAAAAAAAGCAAAGAGCCATTTT 

GTCCTGTTTTCTTGGTTCCATCAATCTGTTTATTAAACATCATCCATATGCTGACCCTGT 

CTCTGTGTGGTTGGGTTGGGAGGCGATCAGCAGATACCATAGTGAACGAAGAGGAAGG 

TTTGAACCATGGGCCCCATCTTTAAAGAAAGTCATTAAAAGAAGGTAAACTTCAAAGT 

GATTCTGGAGTTCTTTGAAATGTGCTGGAAGACTTAAATTTATTAATCTTAAATCATGT 

ACTTTTTTTCTGTAATAGAACTCGGATTCTTTTGCATGATGGGGTAAAGCTTAGCAGAG 

AATCATGGGAGCTAACCTTTATCCCACCTTTGACACTACCCTCCAATCTTGCAACACTA 

TCCTGTTTCTCAGAACAGTTTTTAAATGCCAATCATAGAGGGTACTGTAAAGTGTACAA 

GTTACTTTATATATGTAATGTTCACTTGAGTGGAACTGCTTTTTACATTAAAGTTAAAAT 

CGATCTTGTGTTTCTTCAACCTTCAAAACTATCTCATCTGTCAGATTTTTAAAACTCCAA 

CACAGGTTTTGGCATCTTTTGTGCTGTATCTTTTAAGTGCATGTGAAATTTGTAAAATAG 

AGATAAGTACAGTATGTATATTTTGTAAATCTCCCATTTTTGTAAGAAAATATATATTG 
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TATTTATACATTTTTACTTTGGATTTTTGTTTTGTTGGCTTTAAAGGTCTACCCCACTTTA 
TCACATGTACAGATCACAAATAAATTTTTTTAAATAC 

Figure 70 

IM^PGAAAPLSSLGLCALVLALLGALSAGAGAQPYHGEKGISVPDHGFCQPISIPLCTDIAY 

NQTILPNLLGHTNQEDAGLEVHQFYPLVKVQCSPELRFFLCSMYAPVCTVLDQAIPPCRSLC 

ERARQGCEALMNKFGFQWPEPvLRCENFPVHGAGEICVGQNTSDGSGGPGGGPTAYPTAPY 

LPDLPFTALPPGASDGRGRPAFPFSCPRQLKVPPYLGYRFLGERDCGAPCEPGRANGLMYF 

K^EERRFARLWGWSVLCCASTLFTVLTYEVDMRRFSYPERPIIFLSGCYFMVAVAHVA 

GFLLEDRAVCVERFSDDGYRTVAQGTKKEGCTILFMVLYFFGMASSIWWVILSLTWFLAA 

GMKWGHEAIEANSQYFHLAAWAVPAVKTITILAMGQVDGDLLSGVCYVGLSSVDA 

LRGFVLAPLFVYLFIGTSFLLAGFVSLFRIRTIMKTiDGTKTEKLEKLMVRIGVFSVLYTVPAT 

IVLACYFYEQAFREHWERTWLLQTCKSYAWCPPGHFPPMSPDFTWMIKYLMTMrVGITT 

GFWrWS GKTLQSWRRF YHRLSHS SKGET AV 

Figure71 

ACAGCATGGAGTGGGGTTACCTGTTGGAAGTGACCTCGCTGCTGGCCGCCTTGGCGCT 

GCTGCAGCGCTCTAGCGGCGCTGCGGCCGCCTCGGCCAAGGAGCTGGCATGCCAAGAG 

ATCACCGTGCCGCTGTGTAAGGGCATCGGCTACAACTACACCTACATGCCCAATCAGTT 

CAACCACGACACGCAAGACGAGGCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTG 

GAGATCCAGTGCTCGCCCGATCTCAAGTTCTTCCTGTGCAGCATGTACACGCCCATCTG 

CCTAGAGGACTACAAGAAGCCGCTGCCGCCCTGCCGCTCGGTGTGCGAGCGCGCCAAG 

GCCGGCTGCGCGCCGCTCATGCGCCAGTACGGCTTCGCCTGGCCCGACCGCATGCGCT 

GCGACCGGCTGCCCGAGCAAGGCAACCCTGACACGCTGTGCATGGACTACAACCGCAC 

CGACCTAACCACCGCCGCGCCCAGCCCGCCGCGCCGCCTGCCGCCGCCGCCGCCCGGC 

GAGCAGCCGCCTTCGGGCAGCGGCCACGGCCGCCCGCCGGGGGCCAGGCCCCCGCACC 

GCGGAGGCGGCAGGGGCGGTGGCGGCGGGGACGCGGCGGCGCCCCCAGCTCGCGGCG 

GCGGCGGTGGCGGGAAGGCGCGGCCCCCTGGCGGCGGCGCGGCTCCCTGCGAGCCCG 

GGTGCCAGTGCCGCGCGCCTATGGTGAGCGTGTCCAGCGAGCGCCACCCGCTCTACAA 

CCGCGTCAAGACAGGCCAGATCGCTAACTGCGCGCTGCCCTGCCACAACCCCTTTTTCA 

GCCAGGACGAGCGCGCCTTCACCGTCTTCTGGATCGGCCTGTGGTCGGTGCTCTGCTTC 

GTGTCCACCTTCGCCACCGTCTCCACCTTCCTTATCGACATGGAGCGCTTCAAGTACCC 

GGAGCGGCCCATTATCTTCCTCTCGGCCTGCTACCTCTTCGTGTCGGTGGGCTACCTAG 

TGCGCCTGGTGGCGGGCCACGAGAAGGTGGCGTGCAGCGGTGGCGCGCCGGGCGCGG 

GGGGCGCTGGGGGCGCGGGCGGCGCGGCGGCGGGCGCGGGCGCGGCGGGCGCGGGCG 

CGGGCGGCCCGGGCGGGCGCGGCGAGTACGAGGAGCTGGGCGCGGTGGAGCAGCACG 

TGCGCTACGAGACCACCGGCCCCGCGCTGTGCACCGTGGTCTTCTTGCTGGTCTACTTC 

TTCGGCATGGCCAGCTCCATCTGGTGGGTGATCTTGTCGCTCACATGGTTCCTGGCGGC 

CGGTATGAAGTGGGGCAACGAAGCCATCGCCGGCTACTCGCAGTACTTCCACCTGGCC 

GCGTGGCTTGTGCCCAGCGTCAAGTCCATCGCGGTGCTGGCGCTCAGCTCGGTGGACG 

GCGACCCGGTGGCGGGCATCTGCTACGTGGGCAACCAGAGCCTGGACAACCTGCGCGG 

CTTCGTGCTGGCGCCGCTGGTCATCTACCTCTTCATCGGCACCATGTTCCTGCTGGCCG 

GCTTCGTGTCCCTGTTCCGCATCCGCTCGGTCATCAAGCAACAGGACGGCCCCACCAAG 

ACGCACAAGCTGGAGAAGCTGATGATCCGCCTGGGCCTGTTCACCGTGCTCTACACCG 

TGCCCGCCGCGGTGGTGGTCGCCTGCCTCTTCTACGAGCAGCACAACCGCCCGCGCTG 

GGAGGCCACGCACAACTGCCCGTGCCTGCGGGACCTGCAGCCCGACCAGGCACGCAG 

GCCCGACTACGCCGTCTTCATGCTCAAGTACTTCATGTGCCTAGTGGTGGGCATCACCT 

CGGGCGTGTGGGTCTGGTCCGGCAAGACGCTGGAGTCCTGGCGCTCCCTGTGCACCCG 

CTGCTGCTGGGCCAGCAAGGGCGCCGCGGTGGGCGGGGGCGCGGGCGCCACGGCCGC 

GGGGGGTGGCGGCGGGCCGGGGGGCGGCGGCGGCGGGGGACCCGGCGGCGGCGGGG 
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GGCCGGGCGGCGGCGGGGGCTCCCTCTACAGCGACGTCAGCACTGGCCTGACGTGGCG 

GTCGGGCACGGCGAGCTCCGTGTCTTATCCAAAGCAGATGCCATTGTCCCAGGTCTGA 

GCGGAGGGGAGGGGGCGCCCAGGAGGGGTGGGGAGGGGGGCGAGGAGACCCAAGTG 

CAGCGAAGGGACACTTGATGGGCTGAGGTTCCCACCCCTTCACAGTGTTGATTGCTATT 

AGCATGATAATGAACTCTTAATGGTATCCATTAGCTGGGACTTAAATGACTCACTTAGA 

ACAAAGTACCTGGCATTGAAGCCTCCCAGACCCAGCCCCTTTTCCTCCATTGATGTGCG 

GGGAGCTCCTCCCGCCACGCGTTAATTTCTGTTGGCTGAGGAGGGTGGACTCTGCGGCG 

TTTCCAGAACCCGAGATTTGGAGCCCTCCCTGGCTGCACTTGGCTGGGTTTGCAGTCAG 

ATACACAGATTTCACCTGGGAGAACCTCTTTTTCTCCCTCGACTCTTCCTACGTAAACTC 

CCACCCCTGACTTACCCTGGAGGAGGGGTGACCGCCACCTGATGGGATTGCACGGTTT 

GGGTATTCTTAATGACCAGGCAAATGCCTTAAGTAAACAAACAAGAAATGTCTTAATT 

ATACACCCCACGTAAATACGGGTTTCTTACATTAGAGGATGTATTTATATAATTATTTG 

TTAAATTGTAAAAAAAAAAAGTGTAAAATATGTATATATCCAAAGATATAGTGTGTAC 

ATTTTTTTGTAAAAAGTTTAGAGGCTTACCCCTGTAAGAACAGATATAAGTATTCTATT 

TTGTCAATAAAATGACTTTTGATAAATGATTTAACCATTGCCCTCTCCCCCGCCTCTTCT 

GAGCTGTCACCTTTAAAGTGCTTGCTAAGGACGCATGGGGAAAATGGACATTTTCTGG 

CTTGTCATTCTGTACACTGACCTTAGGCATGGAGAAAATTACTTGTTAAACTCTAGTTC 

TTAAGTTGTTAGCCAAGTAAATATCATTGTTGAACTGAAATCAAAATTGAGTTTTTGCA 

CCTTCCCCAAAGACGGTGTTTTTCATGGGAGCTCTTTTCTGATCCATGGATAACAACTC 

TCACTTTAGTGGATGTAAATGGAACTTCTGCAAGGCAGTAATTCCCCTTAGGCCTTGTT 

ATTTATCCTGCATGGTATCACTAAAGGTTTCAAAACCCTGAAAAAAAA 



Figure 72 

MEWGYLLEVTSLLAALALLQRSSGAAAASAECELACQEITVPLCKGIGYNYTYMPNQFNHD 

TQDEAGLEVHQFWPLVEIQCSPDLKFFLCSMYTPICLEDYKKPLPPCRSVCERAKAGCAPL 

MRQYGFAWPDRMRCDRLPEQGNPDTLCMDYNRTDLTTAAPSPPRRLPPPPPGEQPPSGSG 

HGRPPGARPPHRGGGRGGGGGDAAAPPARGGGGGGKARPPGGGAAPCEPGCQCRAPMVS 

VS SERHPLYNRVKTGQIANCALPCHNPFFS QDERAFT VF WIGL WS VLCF VS TFATVS TFLID 

MERFKYPERPIIFLSACYLFVSVGYLVRLVAGHEKVACSGGAPGAGGAGGAGGAAA.GAG 

AAGAGAGGPGGRGEYEELGAVEQHVRYETTGPALCTVWLLVYFFGMASSIWWVILSLT 

WFLAAGMKWGNE AIAGYS QYFELAAWL VPS VKSIAVLALS S VD GDP V AGIC YVGNQS LD 
NLRGFVLAPLVIYLFIGTMFLLAGFVSLFRmSVIKQQDGPmTHKLEKLMIRLGLFTVLYTV 
PAAWVACLFYEQHNRPRWEATHNCPCLPJ5LQPDQARRPDYAWMLKYFMCLVVGITSG 
VWVWSGKTLESWRSLCTRCCWASKGAAVGGGAGATAAGGGGGPGGGGGGGPGGGGGP 
GGGGGSLYSD VS TGLTWRS GT AS S VS YPKQMPLS Q V 

Figure 73 

CCGCCTTCGGCCCGGGCCTCCCGGGATGGCCGTGGCGCCTCTGCGGGGGGCGCTGCTG 
CTGTGGCAGCTGCTGGCGGCGGGCGGCGCGGCACTGGAGATCGGCCGCTTCGACCCGG 
AGCGCGGGCGCGGGGCTGCGCCGTGCCAGGCGGTGGAGATCCCCATGTGCCGCGGCAT 
CGGCTACAACCTGACCCGCATGCCCAACCTGCTGGGCCACACGTCGCAGGGCGAGGCG 
GCTGCCGAGCTAGCGGAGTTCGCGCCGCTGGTGCAGTACGGCTGCCACAGCCACCTGC 
GCTTCTTCCTGTGCTCGCTCTACGCGCCCATGTGCACCGACCAGGTCTCGACGCCCATT 
. CCCGCCTGCCGGCCCATGTGCGAGCAGGCGCGCCTGCGCTGCGCGCCCATCATGGAGC 
AGTTCAACTTCGGCTGGCCGGACTCGCTCGACTGCGCCCGGCTGCCCACGCGCAACGA 
CCCGCACGCGCTGTGCATGGAGGCGCCCGAGAACGCCACGGCCGGCCCCGCGGAGCCC 
CACAAGGGCCTGGGCATGCTGCCCGTGGCGCCGCGGCCCGCGCGCCCTCCCGGAGACC 
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TGGGCCCGGGCGCGGGCGGCAGTGGCACCTGCGAGAACCCCGAGAAGTTCCAGTACGT 

GGAGAAGAGCCGCTCGTGCGCACCGCGCTGCGGGCCCGGCGTCGAGGTGTTCTGGTCC 

CGGCGCGACAAGGACTTCGCGCTGGTCTGGATGGCCGTGTGGTCGGCGCTGTGCTTCTT 

CTCCACCGCCTTCACTGTGCTCACCTTCTTGCTGGAGCCCCACCGCTTCCAGTACCCCG 

AGCGCCCCATCATCTTCCTCTCCATGTGCTACAACGTCTACTCGCTGGCCTTCCTGATCC 

GTGCGGTGGCCGGAGCGCAGAGCGTGGCCTGTGACCAGGAGGCGGGCGCGCTCTACGT 

GATCCAGGAGGGCCTGGAGAACACGGGCTGCACGCTGGTCTTCCTACTGCTCTACTAC 

TTCGGCATGGCCAGCTCGCTCTGGTGGGTGGTCCTGACGCTCACCTGGTTCCTGGCTGC 

CGGGAAGAAATGGGGCCACGAGGCCATCGAGGCCCACGGCAGCTATTTCCACATGGCT 

GCCTGGGGCCTGCCCGCGCTCAAGACCATCGTCATCCTGACCCTGCGCAAGGTGGCGG 

GTGATGAGCTGACTGGGCTTTGCTACGTGGCCAGCACGGATGCAGCAGCGCTCACGGG 

CTTCGTGCTGGTGCCCCTCTCTGGCTACCTGGTGCTGGGCAGTAGTTTCCTCCTGACCG 

GCTTCGTGGCCCTCTTCCACATCCGCAAGATCATGAAGACGGGCGGCACCAACACAGA 

GAAGCTGGAGAAGCTCATGGTCAAGATCGGGGTCTTCTCCATCCTCTACACGGTGCCC 

GCCACCTGCGTCATCGTTTGCTATGTCTACGAACGCCTCAACATGGACTTCTGGCGCCT 

TCGGGCCACAGAGCAGCCATGCGCAGCGGCCGCGGGGCCCGGAGGCCGGAGGGACTG 

CTCGCTGCCAGGGGGCTCGGTGCCCACCGTGGCGGTCTTCATGCTCAAAATTTTCATGT 

CACTGGTGGTGGGGATCACCAGCGGCGTCTGGGTGTGGAGCTCCAAGACTTTCCAGAC 

CTGGCAGAGCCTGTGCTACCGCAAGATAGCAGCTGGCCGGGCCCGGGCCAAGGCCTGC 

CGCGCCCCCGGGAGCTACGGACGTGGCACGCACTGCCACTATAAGGCTCCCACCGTGG 

TCTTGCACATGACTAAGACGGACCCCTCTTTGGAGAACCCCACACACCTCTAGCCACAC 

AGGCCTGGCGCGGGGTGGCTGCTGCCCCCTCCTTGCCCTCCACGCCCTGCCCCCTGCAT 

CCCCTAGAGACAGCTGACTAGCAGCTGCCCAGCTGTCAAGGTCAGGCAAGTGAGCACC 

GGGGACTGAGGATCAGGGCGGGACCCCGTGAGGCTCATTAGGGGAGATGGGGGTCTC 

CCCTAATGCGGGGGCTGGACCAGGCTGAGTCCCCACAGGGTCCTAGTGGAGGATGTGG 

AGGGGCGGGGCAGAGGGGTCCAGCCGGAGTTTATTTAATGATGTAATTTATTGTTGCG 

TTCCTCTGGAAGCTGTGACTGGAATAAACCCCCGCGTGGCACTGCTGATCCTCTCTGGC 

TGGGAAGGGGGAAGGTAGGAGGTGAGGC 

Figure 74 

MAVAPLRGALLLWQLLAAGGAALEIGRiTDPERGRGAAPCQAVEIPMCRGIGYmTRMPNL 

LGHTSQGEAAAELAEFAPLVQYGCHSHLRFFLCSLYAPMCTDQVSTPIPACRPMCEQARLR 

CAPIMEQFOTGWPDSLDCARLPTRNDPHALCMEAPENATAGPAEPHKGLGMLPVAPRPAR 

PPGDLGPGAGGSGTCENPEKFQYVEKSRSCAPRCGPGVEVFWSRRDKDFALVWMAVWSA 

LCFFSTAFTVLTFLLEPHRFQYPERPIIFLSMCYNVYSLAFLIRAVAGAQSVACDQEAGALY 

VIQEGLENTGCTLWLLLYYFGMASSLWWVVLTLTWFLAAGKKWGHEAffiAHGSYFEDVLA. 

AWGLPALKTrVTLTLRKVAGDELTGLCYVASTDAAALTGFVLVPLSGYLVLGSSFLLTG 

FVALFHIRKIMKTGKjTNTEKLEKLMVKIGWSILYTW 

EQPCAAAAGPGGRRDCSLPGGSWTVAVFMLKIFMSLVVGITSGVWVWSSKTFQTWQSLC 
YRKJAAGRARAKACRAPGSYGRGTHCHYKAPTVVLHMTKTDPSLENPTHL 



Figure 75 

ACACGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCTGTGGCTGGTCCTGCAGGT 

GATGGGCTCGTGCGCCGCCATCAGCTCCATGGACATGGAGCGCCCGGGCGACGGCAAA 

TGCCAGCCCATCGAGATCCCGATGTGCAAGGACATCGGCTACAACATGACTCGTATGC 

CCAACCTGATGGGCCACGAGAACCAGCGCGAGGCAGCCATCCAGTTGCACGAGTTCGC 

GCCGCTGGTGGAGTACGGCTGCCACGGCCACCTCCGCTTCTTCCTGTGCTCGCTGTACG 

CGCCGATGTGCACCGAGCAGGTCTCTACCCCCATCCCCGCCTGCCGGGTCATGTGCGA 

GCAGGCCCGGCTCAAGTGCTCCCCGATTATGGAGCAGTTCAACTTCAAGTGGCCCGAC 
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TCCCTGGACTGCCGGAAACTCCCCAACAAGAACGACCCCAACTACCTGTGCATGGAGG 

CGCCCAACAACGGCTCGGACGAGCCCACCCGGGGCTCGGGCCTGTTCCCGCCGCTGTT 

CCGGCCGCAGCGGCCCCACAGCGCGCAGGAGCACCCGCTGAAGGACGGGGGCCCCGG 

GCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGCGTCGTG 

CGCGCCGCTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCGCGAGGACAAGCGCTTC 

GCAGTGGTCTGGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCCTTCACCGT 

GCTCACCTTCCTCATCGACCCGGCCCGCTTCCGCTACCCCGAGCGCCCCATCATCTTCC 

TCTCCATGTGCTACTGCGTCTACTCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCC 

GAGAGCATCGCCTGCGACCGGGACAGCGGCCAGCTCTATGTCATCCAGGAGGGACTGG 

AGAGCACCGGCTGCACGCTGGTCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTCG 

CTGTGGTGGGTGGTCCTCACGCTCACCTGGTTCCTGGCCGCCGGCAAGAAGTGGGGCC 

ACGAGGCCATCGAAGCCAACAGCAGCTACTTCCACCTGGCAGCCTGGGCCATCCCGGC 

GGTGAAGACCATCCTGATCCTGGTCATGCGCAGGGTGGCGGGGGACGAGCTCACCGGG 

GTCTGCTACGTGGGCAGCATGGACGTCAACGCGCTCACCGGCTTCGTGCTCATTCCCCT 

GGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGTGGCCCTGTTCC 

ACATCCGGAGGGTGATGAAGACGGGCGGCGAGAACACGGACAAGCTGGAGAAGCTCA 

TGGTGCGTATCGGGCTCTTCTCTGTGCTGTACACCGTGCCGGCCACCTGTGTGATCGCC 

TGCTACTTTTACGAACGCCTCAACATGGATTACTGGAAGATCCTGGCGGCGCAGCACA 

AGTGCAAAATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCATCCC 

CGCCGTGGAGATCTTCATGGTGAAGATCTTTATGCTGCTGGTGGTGGGGATCACCAGGG 

GGATGTGGATTTGGACCTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCAGCCGTAG 

GTTAAAGAAGAAGAGCCGGAGAAAACCGGCCAGCGTGATCACCAGCGGTGGGATTTA 

CAAAAAAGCCCAGCATCCCCAGAAAACTCACCACGGGAAATATGAGATCCCTGCCCAG 

TCGCCCACCTGCGTGTGAACAGGGCTGGAGGGAAGGGCACAGGGGCGCCCGGAGCTA 

AGATGTGGTGCTTTTCTTGGTTGTGTTTTTCTTTCTTCTTCTTCTTTTTTTTTTTTTTATAA 

AAGCAAAAGAGAAATACATAAAAAAGTGTTTACCCTGAAATTCAGGATGCTGTGATAC 

ACTGAAAGGAAAAATGTACTTAAAGGGTTTTGTTTTGTTTTGGTTTTCCAGCGAAGGGA 

AGCTCCTCCAGTGAAGTAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGATTCAG 

CCCTCAGAAGAAAACTTTTGTTTAGAGCCCTCCGTAAATATACATCTGTGTATTTGAGT 

TGGCTTTGCTACCCATTTACAAATAAGAGGACAGATAACTGCTTTGCAAATTCAAGAGC 

CTCCCCTGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGAAGGCCACAGTG 

CTGGGCGGCATCCCTGCAGAGGAAAGACAGGACCCGGGGCCCGCCTCACACCCCAGTG 

GATTTGGAGTTGCTTAAAATAGACTCTGGCCTTCACCAATAGTCTCTCTGCAAGACAGA 

AACCTCCATCAAACCTCACATTTGTGAACTCAAACGATGTGCAATACATTTTTTTCTCTT 

TCCTTGAAAATAAAAAGAGAAACAAGTATTTTGCTATATATAAAGACAACAAAAGAAA 

TCTCCTAACAAAAGAACTAAGAGGCCCAGCCCTCAGAAACCCTTCAGTGCTACATTTT 

GTGGCTTTTTAATGGAAACCAAGCCAATGTTATAGACGTTTGGACTGATTTGTGGAAAG 

GAGGGGGGAAGAGGGAGAAGGATCATTCAAAAGTTACCCAAAGGGCTTATTGACTCTT 

TCTATTGTTAAACAAATGATTTCCACAAACAGATCAGGAAGCACTAGGTTGGCAGAGA 

CACTTTGTCTAGTGTATTCTCTTCACAGTGCCAGGAAAGAGTGGTTTCTGCGTGTGTAT 

ATTTGTAATATATGATATTTTTCATGCTCCACTATTTTATTAAAAATAAAATATGTTCTT 

TAAAAAAA 

Figure 76 

MQRPGPRLWLVLQVMGSCAAISSMDMERPGDGK^ 

NQREAAIQLHEFAPLVEYGCHGHLRFFLCSLYAPMCTEQVSTPIPACRVMCEQARLKCSPI 
MEQFNFKWPDSLDCRKLPNKDSIDPNYLC1VIEAP1SING 

PLKDGGPGRGGCDNPGKFHHVEKSASCAPLCTPGVDVYWSREDKRFAWWLAIWAVLCF 
FSSAFTVLTFLIDPAP^RyPERPnFLSMCYCWSVGYLIPXFAGAESIACDRDSGQLYVIQEG 
LESTGCTLVFLVLYYFGMAS SLWWVVLTLTWFLAAGE3CWGHEAIEAISrS S YFHLAAWAIP 
AVKTILILVMRRVAGDELTGVCYVGSMDVNALTGFVLIPLACYLVIGTSFILSGFVAL 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



54/66 



PCT/GB02/01195 



FHIRRVMKTGGENTDKLEKLMWTGLFSVLYTWATCVIACYFYERLNMDYWKILAAQHK 
CKMNNQTKTLDCLMAASIPAVEIFMVEnFMLLWGITSGMWTWTSKTLQSWQQVCSRRLK 
KKSRRKP AS V]TS GGIYKKAQHPQKTHHGKYEIPAQSPTCV 



Figure 77 

CCTGCAGCCTCCGGAGTCAGTGCCGCGCGCCCGCCGCCCCGCGCCTTCCTGCTCGCCGC 

ACCTCCGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCCCCGCCCGCGCCGCCT 

CCGACCGCAGGCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCC 

GCGGAGCCGGGCAACGCTGGGGACTGCGCCTTTTGTCCCCGGAGGTCCCTGGAAGTTT 

GCGGCAGGACGCGCGCGGGGAGGCGGCGGAGGCAGCCCCGACGTCGCGGAGAACAGG 

GCGCAGAGCCGGCATGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCCCTGGG 

CGTGCTGCTGGCGCTGGGCGCGGCGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGAC 

TACGTGAGCTTCCAGTCGGACATCGGCCCGTACCAGAGCGGGCGCTTCTACACCAAGC 

CACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGCTACAA 

GAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCA 

GGCCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGGACCCAGGTCTTC 

CTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCT 

CTGCGAGGCCGTGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGC 

CCGAGATGCTTAAGTGTGACAAGTTCCCGGAGGGGGACGTCTGCATCGCCATGACGCC 

GCCCAATGCCACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGAC 

AACGAGTTGAAATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGA 

GGATGAAAATAAAAGAAGTGAAAAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGA 

AGAAGAAGCCCCTGAAGTTGGGGCCCATCAAGAAGAAGGACCTGAAGAAGCTTGTGC 

TGTACCTGAAGAATGGGGCTGACTGTCCCTGCCACCAGCTGGACAACCTCAGCCACCA 

CTTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTTGCTGACGGCCATCCACAAG 

TGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGAAGAAAATGAAAAACCATGAG 

TGCCCCACCTTTCAGTCCGTGTTTAAGTGATTCTCCCGGGGGCAGGGTGGGGAGGGAG 

CCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCGGGAACCCGTGGTCACACACAC 

GCACTGCCCTGTCAGTAGTGGACATTGTAATCCAGTCGGCTTGTTCTTGCAGCATTCCC 

GCTCCCTTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGCCATGGCCGGGTAAAG 

CAAGGGCCATTTAGATTAGGAAGGTTTTTAAGATCCGCAATGTGGAGCAGCAGCCACT 

GCACAGGAGGAGGTGACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAA 

AGGGGGATTGGGCGGAAAGTGAGAGCCAGCAGCAAAAACTACATTTTGCAACTTGTTG 

GTGTGGATCTATTGGCTGATCTATGCCTTTCAACTAGAAAATTCTAATGATTGGCAAGT 

CACGTTGTTTTCAGGTCCAGAGTAGTTTCTTTCTGTCTGCTTTAAATGGAAACAGACTC 

ATACCACACTTACAATTAAGGTCAAGCCCAGAAAGTGATAAGTGCAGGGAGGAAAAG 

TGCAAGTCCATTATCTAATAGTGACAGCAAAGGGACCAGGGGAGAGGCATTGCCTTCT 

CTGCCCACAGTCTTTCCGTGTGATTGTCTTTGAATCTGAATCAGCCAGTCTCAGATGCC 

CCAAAGTTTCGGTTCCTATGAGCCCGGGGCATGATCTGATCCCCAAGACATGTGGAGG 

GGCAGCCTGTGCCTGCCTTTGTGTCAGAAAAAGGAAACCACAGTGAGCCTGAGAGAGA 

CGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACACATAGTTA 



Figure 78 

MGIGRSEGGRRGAALGVLLALGAALLAVGSASEYDYVSFQSDIGPYQSGRFYTKPPQCVDI 

PADLRLCHNVGYKKMVLPOT.LEHETMAEVKQQASSWWLLNKNCHAGTQVFLCSLFAPV 

CLDPvPrYPCRWLCEAVPJSSCEPVMQFFGFYWPEMLKCDKFPEGDVCIAM 

QGTTVCPPCDNELKSEA3IEHLCASEFGLSLKMIVGSSHNSCCTLGPSHPNSSKRQEQELGTP 

ERRLGYGLLLHFIQGNLPPPCAQARSRMRLKTEATPLALGRSAPGLFADCPERPLPVCSFPH 
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HTEEVGKXRIHSFLLQVKGFSMKGLCAPSTLRYLYYLKTSMQH^/HQEyQAHSAQVWANM 
PPAERCKDEEDKAMFSK 



Figure 79 

GAATTCGTTCAGCCTGGTTAAGTCCAAGCTGGCTCATTCTGCTCCCCCGGGTCGGAGCC 

ccccggagctgcgcgcgggcttgcagcgcctcgcccgcgctgtcctcccggtgtcccgc 
ttctccgcgccccagccgccggctgccagcttttcggggccccgAgtcgcacccagcga 
agagagcgggcccgggacaagctcgaactccggccgcctcgcccttaaccagctccgt 
ccctctaccccctaggggtcgcgcccacgatgctgcagggccctggctcgctgctgctg 

CTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTCTTTGGCCA 

GCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGCCAACCTGCAGCTG 

TGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 

TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCA 

CCCGGACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAG 

ACGAGACCATCCAGCCATGCCACTCTCGNTGCGTGCAGGTGAAGGATCGCTGCGCCCC 

GGTCATGTCCGCCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACA 

ACGACCTTTGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGC 

TCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGA 

AACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATC 

AACCGT 



Figure 80 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPAm,QLCHGIEYQNMRLP 
NLLGHETMKEVLEQAGAWIPLVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSRCVQV 
KDRCAPVMSAFPWPDMLECDRFPQDM3LCIPLASSDHLLPATEEAPKVCEACKNKNDDDN 
DIMETLCKNDFALKIKVKEITYINR 

Figure 8 1 

CCGGGTCGGAGCCCCCCGGAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCC 

TCCCGGTGTCCCGCTTCTCCGCGCCCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGA 

GTCGCACCCAGCGAAGAGAGCGGGCCCGGGACAAGCTCGAACTCCGGCCGCCTCGCCC 

TTCCCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGATGCTGCAGGGCC 

CTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGG 

CTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATC 

CCTGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAA.ee 

TGCTGGGCCACGAGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCT 

GGTCATGAAGCAGTGCCACCCGGACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCC 

GTCTGCCTCGATGACCTAGACGAGACCATCCAGCCATGCCACTCGCTCTGCGTGCAGGT 

GAAGGACCGCTGCGCCCCGGTCATGTCCGCCTTCGGCTTCCCCTGGCCCGACATGCTTG 

AGTGCGACCGTTTCCCCCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCA 

CCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAAT 

GATGATGACAACGACATAATGGAAACGCTTTGTAAAAATGATTTTGCACTGAAAATAA 

AAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATCCTGGAGACCAAGA 
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GCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCTGAAGAAATCGGTGCT 

GTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACGCGCCC 

TATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGGT 

GGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGT 

GCTAGTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTT 

CTGCTCCGGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGC 

CTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAG 

GCCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGA 

ATCTTGTAGAAATATTCAAACTAATAAAATCATGAATATTTTTATGAAGTTT 



Figure 82 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYK^SNCKTIPANLQLCHGIEYQINMRLP 

NLLGHETMKEVLEQAGAWE>LVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQV 

KDRCAPVMSAFGFPWPDMLECDPJFPQDNDLCIPLASSDHLLPATEEAPKVCEACKNKKDD 

DNDEvffiTLCE^FAEKIKVKEITYlKRDTKIILETKSKTIYKLNGVSEPJ^ 

QCTCEEMNDINAPYLVMGQKQGGELVITSVKRWQKGQP^FKRISRSIRKXQC 



Figure 83 

ACGGGGCCTGGGCGGSAGGGGCGGTGGCTGGAGCTCGGTAAAGCTCGTGGGACCCCAT 

TGGGGGAATTTGATCCAAGGAAGCGGTGATTGCCGGGGGAGGAGAAGCTCCCAGATCC 

TTGTGTCCACTTGCAGCGGGGGAGGCGGAGACGCGGAGCGGGCCTTTTGGCGTCCACT 

GCGCGGCTGCACCCTGCCCCATCCTGCCGGGATCATGGTCTGCGGCAGCCCGGGAGGG 

ATGCTGCTGCTGCGGGCCGGGCTGCTTGCCCTGGCTGCTCTCTGCCTGCTCCGGGTGCC 

CGGGGCTCGGGCTGCAGCCTGTGAGCCCGTCCGCATCCCCCTGTGCAAGTCCCTGCCCT 

GGAACATGACTAAGATGCCCAACCACCTGCACCACAGCACTCAGGCCAACGCCATCCT 

GGCCATCGAGCAGTTCGAAGGTCTGCTGGGCACCCACTGCAGCCCCGATCTGCTCTTCT 

TCCTCTGTGCCATGTACGCGCCCATCTGCACCATTGACTTCCAGCACGAGCCCATCAAC 

CCCTGTAAGTCTGTGTGCGAGCGGGCCCGGCAGGGCTGTGAGCCCATACTCATCAAGT 

ACCGCCACTCGTGGCCGGAGAACCTGGCCTGCGAGGAGCTGCCAGTGTACGACAGGGG 

CGTGTGCATCTCTCCCGAGGCCATCGTTACTGCGGACGGAGCTGATTTTCCTATGGATT 

CTAGTAACGGAAACTGTAGAGGGGCAAGCAGTGAACGCTGTAAATGTAAGCCTATTAG 

AGCTACACAGAAGACCTATTTCCGGAACAATTACAACTATGTCATTCGGGCTAAAGTT 

AAAGAGATAAAGACTAAGTGCCATGATGTGACTGCAGTAGTGGAGGTGAAGGAGATT 

CTAAAGTCCTCTCTGGTAAACATTCCACGGGACACTGTCAACCTCTATACCAGCTCTGG 

CTGCCTCTGCCCTCCACTTAATGTTAATGAGGAATATATCATCATGGGCTATGAAGATG 

AGGAACGTTCCAGATTACTCTTGGTGGAAGGCTCTATAGCTGAGAAGTGGAAGGATCG 

ACTCGGTAAAAAAGTTAAGCGCTGGGATATGAAGCTTCGTCATCTTGGACTCAGTAAA 

AGTGATTCTAGCAATAGTGATTCCACTCAGAGTCAGAAGTCTGGCAGGAACTCGAACC 

CCCGGCAAGCACGCAACTAAATCCCGAAATACAAAAAGTAACACAGTGGACTTCCTAT 

TAAGACTTACTTGCATTGCTGGACTAGCAAAGGAAAATTGCACTATTGCACATCATATT 

CTATTGTTTACTATAAAAATCATGTGATAACTGATTATTACTTCTGTTTCTCTTTTGGTTT 

CTGCTTCTCTCTTCTCTCAACCCCTTTGTAATGGTTTGGGGGCAGACTCTTAAGTATATT 

GTGAGTTTTCTATTTCACTAATCATGAGAAAAACTGTTCTTTTGCAATAATAATAAATT 

AAACATGCTGTTA 

Figure 84 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



57/66 



PCT/GB02/01195 



MVCGSPGGMIXLRAGLLALAALCLL^ 

STQANAIIAmQFEGLLGTHCSPDLLFFLCAMYAPICTIDFQHEPIKPCKSVCERARQGCEPIL 

IKYPJHSWPENLACEELPVYDRGVCISPEAIVTADGADFPMDSSNGNCRGASSERCKCKPIR 

ATQKTYFRNNYNYVIRAKVEEIKTKCHDVTAVVEVKEILKSSLVNIPP^ 

PPLNVNGBEYIIMGYEDEERSRLLLV^ 

SDSTQSQKSGRNSNPRQARN 



Figure 85 

CAGCGGCCGCTGAATTCTAGGGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGC 

TCGTGCCCTGTGTGCCAGACGGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCTTTG 

CTGCCGACTGGAGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGAAGATTTCTTCCTCGG 

CGAAGGGACAGCGAAAGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGG 

GGTCGCAGCGCGAGAGGGCAGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGT 

GGCTGCACCTGGCGCTGGGCGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTAT 

GTGCCGGCACATGCCCTGGAACATCACGCGGATGCCCAACCACCTGCACCACAGCACG 

CAGGAGAACGCCATCCTGGCCATCGAGCAGTACGAGGAGCTGGTGGACGTGAACTGC 

AGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCCATTTGCACCCTGGAGTT 

CCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGCGCGACGACTGC 

GAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGC 

TGCCTGTCTATGACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCCG 

GAGGATGTTAAGTGGATAGACATCACACCAGACATGATGGTACAGGAAAGGCCTCTTG 

ATGTTGACTGTAAACGCCTAAGCCCCGATCGGTGCAAGTGTAAAAAGGTGAAGCCAAC 

TTTGGCAACGTATCTCAGCAAAAACTACAGCTATGTTATTCATGCCAAAATAAAAGCTG 

TGCAGAGGAGTGGCTGCAATGAGGTCACAACGGTGGTGGATGTAAAAGAGATCTTCAA 

GTCCTCATCACCCATCCCTCGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGT 

GTCCACACATCCTGCCCCATCAAGATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGG 

ATGATGCTTCTTGAAAATTGCTTAGTTGAAAAATGGAGAGATCAGCTTAGTAAAAGAT 

CCATACAGTGGGAAGAGAGGCTGCAGGAACAGCGGAGAACAGTTCAGGACAAGAAGA 

AAACAGCCGGGCGCACCAGTCGTAGTAATCCCCCCAAACCAAAGGGAAAGCCTCCTGC 

TCCCAAACCAGCCAGTCCCAAGAAGAACATTAAAACTAGGAGTGCCCAGAAGAGAAC 

AAACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGACTTCCTTA 

CAGGATGAGGCTGGGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCCC 

TAACAACTCACTGCAGTGCTCTTCATAGACACATCTTGCAGCATTTTTCTTAAGGCTAT 

GCTTCAGTTTTTCTTTGTAAGCCATCACAAGCCATAGTGGTAGGTTTGCCCTTTGGTACA 

GAAGGTGAGTTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTG 

TGCATACTCTAGAAGAGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGC 

ATTGTAAAATAAATGCCATATTTCAAACAAAACACGTAATTTTTTTACAGTATGTTTTA 

TTACCTTTTGATATCTGTTGTTGCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATA 

ATGTTTTTAAGAAGGAACAGTAGTGGAATGAATGTTAAAAGATCTTTATGTGTTTATGG 

TCTGCAGAAGGATTTTTGTGATGAAAGGGGATTTTTTGAAAAATTAGAGAAGTAGCAT 

ATGGAAAATTATAATGTGTTTTTTTACCAATGACTTCAGTTTCTGTTTTTAGCTAGAAAC 

TTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGGCAGACAATGTC 

TGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCT 

CTTAAGCAGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTA 

GTTGGCTAATGCTCAAGTATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTC 

CCAGGACATCCACCCTGAGAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTG 

CCAAATTTTGTTTTTCTTCATTTAAATATTTTCTTTGCCTAAATACATGTGAGAGGAGTT 

AAATATAAATGTACAGAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGA 

CAGTTGGGATACTTTAATCAGAAAAAAAGAACTTATTTGCAGCATTTTATCAACAAATT 

TCATAATTGTGGACAATTGGAGGCATTTATTTTAAAAAACAATTTTATTGGCCTTTTGCT 
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AACACAGTAAGCATGTATTTTATAAGGCATTCAATAAATGCACAACGCCCAAAGGAAA 

TAAAATCCTATCTAATCCTACTCTCCACTACACAGAGGTAATCACTATTAGTATTTTGG 

CATATTATTCTCCAGGTGTTTGCTTATGCACTTATAAAATGATTTGAACAAATAAAACT 

AGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCCTTTATTGAGATA 

AGTTTTCCTGTCAAGAAAGCAGAAACCATCTCATTTCTAACAGCTGTGTTATATTCCAT 

AGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGA 

CAATACTGAATAAACATCTCACCGGAATTC 

Figure 86 

MFLS H.VALCLWLHL ALGVRGAPCE AVRIPMCRHMP TQENAILAIEQ Y 

EELVDWCSAVLRFFLCAMYAPICTLEFLH^ 

SLACDELPVYDRGVCISPEAIVTDLPEDVKWroiTPDMMVQERPLDVDCKRLSPDRCKCKK 
VKPTLATYLSJGSrx"SYVffi^^ 

PHILPHQDVLIMCYEWSRMMLLENCLVEKWRDQLSKRSIQWEERLQEQRRTVQDKKKT 
AGRTSRSNPPKPKGKPPAPKPASPKKN1KTRSAQKRTNPICRV 



Figure 87 

AAGCTTGATATCGAATTCGCGGCCGCGTCGACGGGAGGCGCCAGGATCAGTCGGGGCA 

CCCGCAGCGCAGGCTGCCACCCACCTGGGCGACCTCCGCGGCGGCGGCGGCGGCGGCT 

GGGTAGAGTCAGGGCCGGGGGCGCACGCCGGAACACCTGGGCCGCCGGGCACCGAGC 

GTCGGGGGGCTGCGCGGCGCGACCCTGGAGAGGGCGCAGCCGATGCGGGCGGCGGCG 

GCGGCGGGGGGCGTGCGGACGGCCGCGCTGGCGCTGCTGCTGGGGGCGCTGCACTGG 

GCGCCGGCGCGCTGCGAGGAGTACGACTACTATGGCTGGCAGGCCGAGCCGCTGCACG 

GCCGCTCCTACTCCAAGCCGCCGCAGTGCCTTGACATCCCTGCCGACCTGCCGCTCTGC 

CACACGGTGGGCTACAAGCGCATGCGGCTGCCCAACCTGCTGGAGCACGAGAGCCTGG 

CCGAAGTGAAGCAGCAGGCGAGCAGCTGGCTGCCGCTGCTGGCCAAGCGCTGCCACTC 

GGATACGCAGGTCTTCCTGTGCTCGCTCTTTGCGCCCGTCTGTCTCGACCGGCCCATCT 

ACCCGTGCCGCTCGCTGTGCGAGGCCGTGCGCGCCGGCTGCGCGCCGCTCATGGAGGC 

CTACGGCTTCCCCTGGCCTGAGATGCTGCACTGCCACAAGTTCCCCCTGGACAACGACC 

TCTGCATCGCCGTGCAGTTCGGGCACCTGCCCGCCACCGCGCCTCCAGTGACCAAGATC 

TGCGCCCAGTGTGAGATGGAGCACAGTGCTGACGGCCTCATGGAGCAGATGTGCTCCA 

GTGACTTTGTGGTCAAAATGCGCATCAAGGAGATCAAGATAGAGAATGGGGACCGGA 

AGCTGATTGGAGCCCAGAAAAAGAAGAAGCTGCTCAAGCCGGGCCCCCTGAAGCGCA 

AGGACACCAAGCGGCTGGTGCTGCACATGAAGAATGGCGCGGGCTGCCCCTGCCCACA 

GCTGGACAGCCTGGCGGGCAGCTTCCTGGTCATGGGCCGCAAAGTGGATGGACAGCTG 

CTGCTCATGGCCGTCTACCGCTGGGACAAGAAGAATAAGGAGATGAAGTTTGCAGTCA 

AATTCATGTTCTCCTACCCCTGCTCCCTCTACTACCCTTTCTTCTACGGGGCGGCAGAGC 

CCCACTGAAGGGCACTCCTCCTTGCCCTGCCAGCTGTGCCTTGCTTGCCCTCTGGCCCC 

GCCCCAACTTCCAGGCTGACCCGGCCCTACTGGAGGGTGTTTTCACGAATGTTGTTACT 

GGCACAAGGCCTAAGGGATGGGCACGGAGCCCAGGCTGTCCTTTTTGACCCAGGGGTC 

CTGGGGTCCCTGGGATGTTGGGCTTCCTCTCTCAGGAGCAGGGCTTCTTCATCTGGGTG 

AAGACCTCAGGGTCTCAGAAAGTAGGCAGGGGAGGAGAGGGTAAGGGAAAGGTGGAG 

GGGCTCAGGGCACCCTGAGGCGGAGGTTTCAGAGTAGAAGGTGATGTCAGCTCCAGCT 

CCCCTCTGTCGGTGGTGGGGCCTCACCTTGAAGAGGGAAGTCTCAATATTAGGCTAAG 

CTATTTGGGAAAGTTCTCCCCACCGCCCCTGTACGCGTCATCCTAGCCCCCCTTAGGAA 

AGGAGTTAGGGTCTCAGTGCCTCCAGCCACACCCCCTGCCTTCCCCAGCTTGCCCATTT 

CCCTGCCCCAAGGCCCAGAGCTCCCCCCAGACTGGAGAGCAAGCCCAGCCCAGCCTCG 

GCATAGACCCCCTTCTGGTCCGCCCGTGGCTCGATTCCCGGGATTCATTCCTCAGCCTC 
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TGCTTCTCCCTTTTATCCCAATAAGTTATTGCTACTGCTGTGAGGCCATAGGTACTAGAC 
AACCAATACATGCAGGGTTGGGTTTTCTAATTTTTTTAACTTTTTAATTAAATCAAAGGT 
CGACGCGCGGCCGCGGAATTCCTGCAGCCCGGGGGATCCCCGGGTACCGAGCTCGAAT 
TC 

Figure 88 



TEILPALCVLIHHTDVNILVDWWALSYLTDAGInEQIQMVIDSGIVPHLVPLLSHQEVKVQT 

AAI^VGNWTGTDEQTQWLNCDALSHFPALLTHPE^KINEJBAVWLSNITAGNQQQVQ 

AVIDANLWMimLLDKGDFGTQKEAAWAISNLTISGRKX)QVAYLIQQNVIPPFCNLLTVKD 

AQVVQVVLDGLSNILKMAEDEAETIGNL^ 

DDIDEDPSLVPEAIQGGTFGFNSSANVPTEGFQF 



Figure 89 

ATGCATCTCCTCTTATTTCAGCTGCTGGTACTCCTGCCTCTAGGAAAGACCACACGGCA 

CCAGGATGGCCGCCAGAATCAGAGTTCTCTTTCCCCCGTACTCCTGCCAAGGAATCAA 

AGAGAGCTTCCCACAGGCAACCATGAGGAAGCTGAGGAGAAGCCAGATCTGTTTGTCG 

CAGTGCCACACCTTGTAGCCACCAGCCCTGCAGGGGAAGGCCAGAGGCAGAGAGAGA 

AGATGCTGTCCAGATTTGGCAGGTTCTGGAAGAAGCCTGAGAGAGAAATGCATCCATC 

CAGGGACTCAGATAGTGAGCCCTTCCCACCTGGGACCCAGTCCCTCATCCAGCCGATA 

GATGGAATGAAAATGGAGAAATCTCCTCTTCGGGAAGAAGCCAAGAAATTCTGGCACC 

ACTTCATGTTCAGAAAAACTCCGGCTTCTCAGGGGGTCATCTTGCCCATCAAAAGCCAT 

GAAGTACATTGGGAGACCTGCAGGACAGTGCCCTTCAGCCAGACTATAACCCACGAAG 

GCTGTGAAAAAGTAGTTGTTCAGAACAACCTTTGCTTTGGGAAATGCGGGTCTGTTCAT 

TTTCCTGGAGCCGCGCAGCACTCCCATACCTCCTGCTCTCACTGTTTGCCTGCCAAGTTC 

ACCACGATGCACTTGCCACTGAACTGCACTGAACTTTCCTCCGTGATCAAGGTGGTGAT 

GCTGGTGGAGGAGTGCCAGTGCAAGGTGAAGACGGAGCATGAAGATGGACACATCCT 

ACATGCTGGCTCCCAGGATTCCTTTATCCCAGGAGTTTCAGCTTGA 

Figure 90 

MHLLLFQLLVLLPLGKTTRHQDGRQNQSSLSPVLLPRNQRELPTGNHEEAEEKPDLFVAVP 
HLVATSPAGEGQRQPvEKMLSRFGPJ r WEXPEREMEtt > SRDSDSEPFPPGTQSLIQPDDGM 
KSPLREEAKEFWHHFMFP^TPA^QGVILPIKSHEVHWETCRTVPFSQTITHEGCEKVVVQN 
NLCFGKCGS VHFP GAAQHSHTS C SHCLP AKFTTMHLPLNCTELS S VIKV VML VEECQCKV 
KTEHEDGHILHAGSQDSFIPGVSA 



Figure 9 1 

CGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCT 

CCCTCTTGAGTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTT 

GTCGCGATGGTAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCT 

TGAACTCGGTTCTCAATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCT 

GCGGGGCACCCAGGCTCTGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGA 

ATAAGTACCAGACCATTGACAACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTG 

CGGCACTGATGAGTACTGCGCTAGTCCCACCCGCGGAGGGGACGCAGGCGTGCAAATC 

TGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGCGTCACGCTATGTGCTGCCCCG 
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GGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAAATCATTTCCGAGGAGA 

AATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTAT 

TCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTG 

TTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCA 

AGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGG 

CTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGA 

TACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACAC 

TAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTT 

TTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCA 

TTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCC 

CCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAAT 

GCAATGAAACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTA 

TGTAAATTTTTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGA 

AGTAAATCATTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAG 

AGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACAT 

GAAAATACTAGCTTATTTTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTT 

AGGCTGTGATAGTTTTTGAAATAAAATTTAACATTTAATATCATGAAATGTTATAAGTA 

GACAT 



Figure 92 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSA 
VSAAPGILYPGGNKYQTrDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRK 
RCMRHAMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKM^ 
TKGQEGSVCLRSSDCASGLCCAREIFWSKICKPVLKEGQVCTKHRRKGSHGLErFQRCYCG 
EGLS CRIQKDHHQ ASNS SRLHTC QRH 



Figure 93 

GCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGG 

GTCCTGGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGAC 

CCAAGTGAGGGGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGC 

TTTGCGTCTTCCTGGGGACCCCCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGG 

ATTCGTCCTGCTGCCTGCTCCTACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATC 

GGCAGTTCGCGGGCCAAACTCAACTCCATCAAGTCCTCTCTGGGCGGGGAGACGCCTG 

GTCAGGCCGCCAATCGATCTGCGGGCATGTACCAAGGACTGGCATTCGGCGGCAGTAA 

GAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGTGATAAGGAGTGTGAAGTT 

GGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGGTGTGTCGGAGAA 

AAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCAATAATGG 

CATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGGTA 

CTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGAA 

TCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGC 

CTACGATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAAT 

CTGCAAACCAGTGCTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCT 

CATGGGCTGGAAATTTTCCAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATG 

GAAAGATGCCACCTACTCCTCCAAAGCCAGACTCCATGTGTGTCAGAAAATTTGATCA 

CCATTGAGGAACATCATCAATTGCAGACTGTGAAGTTGTGTATTTAATGCATTATAGCA 

TGGTGGAAAATAAGGTTCAGATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGA 

ATATAGATGATCAC 



SUBSTITUTE SHEET (RULE 26) 



WO 02/077204 



61/66 



PCT/GB02/01195 



Figure 94 
MAALMRSKDSSCCLLLLA^ 

LAFGGSKKGK^GQAYPCSSDKECEVGRYCHSPHQGSSACMVCRRKKKRCHRDGMCCPS 
TRCNNGICIPVTESILTPHIPALDGTRHRDR]^^ 

DPCDRSSDCffiGFCCARHFWTEJCKPVLHQGEVCTKQRKKGSHGLEIFQRCDCAKGLSCKV 
WKDATYSSKARLHVCQKI 

Figure 95 

ctatcacaatgagaccaacacagacacgaaggttggaaataataccatccatgtgcac 

cgagaaattcacaagataaccaacaaccagactggacaaatggtcttttcagagacag 

ttatcacatctgtgggagacgaagaaggcagaaggagccacgagtgcatcatcgacg 

aggactgtgggcccagcatgtactgccagtttgccagcttccagtacacctgccagcc 

atgccggggccagaggatgctctgcacccgggacagtgagtgctgtggagaccagctg 

tgtgtctggggtcactgcaccaaaatggccaccaggggcagcaatgggaccatctgtg 

acaaccagagggactgccagccggggctgtgctgtgccttccagagaggcctgctgtt 

ccctgtgtgcacacccctgcccgtggagggcgagctttgccatgaccccgccagccgg 

cttctggacctcatcacctgggagctagagcctgatggagccttggaccgatgcccttg 

tgccagtggcctcctctgccagccccacagccacagcctggtgtatgtgtgcaagccg 

accttcgtggggagccgtgaccaagatggggagatcctgctgcccagagaggtccccg 

atgagtatgaagttggcagcttcatggaggaggtgcgccaggagctggaggacctgga 

gaggagcctgactgaagagatggcgctgggggagcctgcggctgccgccgctgcactg 

ctgggaggggaagagatttagatctggaccaggctgtgggtagatgtgcaatagaaat 

agctaatttatttccccaggtgtgtgctttaggcgtgggctgaccaggcttcttcctac 

atcttcttcccagtaagtttcccctctggcttgacagcatgaggtgttgtgcatttgttc 

agctcccccaggctgttctccaggcttcacagtctggtgcttgggagagtcaggcagg 

gttaaactgcaggagcagtttgccacccctgtccagattattggctgctttgcctctac 

cagttggcagacagccgtttgttctacatggctttgataattgtttgaggggaggagat 

ggaaacaatgtggagtctccctctgattggttttggggaaatgtggagaagagtgccc 

tgctttgcaaacatcaacctggcaaaaatgcaacaaatgaattttccacgcaGttcttt 

ccatgggcataggtaagctgtgccttcagctgttgcagatgaaatgttctgttcaccct 

gcattacatgtgtttattcatccagcagtgttgctcagctcctacctctgtgccagggc 

agcattttcatatccaagatcaattccctctctcagcacagcctggggagggggtcatt 

gttctcctcgtccatcagggatctcagaggnctcagagactgcaagctgcttgcccaa 

gtcacacagctagtgaagaccagagcagtttcatctggttgtgactctaagctcagtgc 

tctctccactaccccacaccagccttggtgccaccaaaagtgctccccaaaaggaagg 

agaatgggatttttcttttgaggcatgcacatctggaattaaggtcaaactaattctca 

catccctctaaaagtaaactactgttaggaacagcagtgttctcacagtgtggggcag 

ccgtccttctaatgaagacaatgatattgacactgtccctctttggcagttgcattagt 

aactttgaaaggtatatgactgagcgtagcatacaggttaacctgcagaaacagtact 

taggtaattgtagggcgaggattataaatgaaatttgcaaaatcacttagcagcaact 

gaagacaattatcaaccacgtggagaaaatcaaaccgagcagggctgtgtgaaacat 

ggttgtaatatgcgactgcgaacactgaactctacgccactccacaaatgatgttttca 

ggtgtcatggactgttgccaccatgtattcatccagagttcttaaagtttaaagttgca 

catgattgtataagcatgctttctttgagttttaaattatgtataaacataagttgcatt 

tagaaatcaagcataaatcac 
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Figure 96 

MQI^GATLLCLLLAAAWTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDT 

QHKLRSAVEEMEAEEAAAKASSEVNLANLPPSYHNETNTDTKVGNNTfflVHREm 

TGQMWSETVITSVGDEEGI^SHECI]DEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSE 

CCGDQLCVWGHCTKMATRGSNGTICDNQPvDCQPGLCCAFQRGLLFPVCTPLPVEGELCHD 

PASRLLDLITWELEPDGALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVP 

DEYEVGSFMEEVRQELEDLERSLTEEMALGEPAAAAAALLGGEEI 



Figure 97 

AGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGCCT 

CGCTTTGGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTG 

GCGGCCGTCCTGCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGA 

CTTCAACAACATCAGGAGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGC 

CTGTCTGACACGGACTGCAATACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGC 

CGTTCTGTGCTACATGTCGTGGGTTGCGGAGGAGGTGCCAGCGAGATGCCATGTGCTG 

CCCTGGGACACTCTGTGTGAACGATGTTTGTACTACGATGGAAGATGCAACCCCAATAT 

TAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGCAGAAGGAACAACTGGGCACC 

CAGTCCAGGAAAACCAACCCAAAAGGAAGCCAAGTATTAAGAAATCACAAGGCAGGA 

AGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTGTGGCCCTGGACTTTGCTG 

TGCTCGTCATTTTTGGACGAAAATTTGTAAGCCAGTCCTTTTGGAGGGACAGGTCTGCT 

CCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGTTGCGACTG 

TGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGATTA 

AGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACAT 

TGC 

Figure 98 

MVAAVLLGLSWLCSPLGALVLDFN^mSSADLHGARKGSQCLSDTDCNTRKFCLQPRDEK 
PFCATCRGLRRRCQRDAMCCPGTLCVNDVCTTMEDATPILERQLDEQDGTHAEGTTGHPV 
QENQPKRKRSKKSQGRKGQEGESCLRTFDCGPGLCCARHFWTKICKPVLLEGQVCSRRGH 
KDTAQAPEBFQRCDCGPGLLCRSQLTSNRQHARLRVCQKIEKL 



Figure 99 

AGGCAGAATACTTCTATGAATTCCTGTCCTTGCGCTCCCTGGATAAAGGCATCATGGCA 

GATCCAACCGTCAATGTCCCTCTGCTGGGAACAGTGCCTCACAAGGCATCAGTTGTTCA 

AGTTGGTTTCCCATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTGAAGTGGATGTG 

ATTGTTATGAATTCTGAAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCTT 

TAAAACATGTCAACAAGCTGAGTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAAT 

GAAAGACGCATCTGCGAGTGTCCTGATGGGTTCCACGGACCTCACTGTGAGAAAGCCC 

TTTGTACCCCACGATGTATGAATGGTGGACTTTGTGTGACTCCTGGTTTCTGCATCTGCC 

CACCTGGATTCTATGGAGTGAACTGTGACAAAGCAAACTGCTCAACCACCTGCTTTAAT 

GGAGGGACCTGTTTCTACCCTGGAAAATGTATTTGCCCTCCAGGACTAGAGGGAGAGC 
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AGTGTGAAATCAGCAAATGCCCACAACCCTGTCGAAATGGAGGTAAATGCATTGGTAA 

AAGCAAATGTAAGTGTTCCAAAGGTTACCAGGGAGACCTCTGTTCAAAGCCTGTCTGC 

GAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAACAAATGCCAATGTCAAG 

AAGGTTGGCATGGAAGACACTGCAATAAAAGGTACGAAGCCAGCCTCATACATGCCCT 

GAGCGCAGCAGCGCCCAGCTCAGGCAGCACACGCCTTCACTTAAAAAGGCCGAGGAG 

CGGCGGCATCCACCTGAATCCAATTACATCTGGTGAACTCCGACATCTGAAACGTTTTA 

AGTTACACCAAGTTCATAGCCTTTGTTAACCTTTCATGTGTTGAATGTTCAAATAATGTT 

CATTACACTTAAGAATACTGGCCTGAATTTTATTAGCTTCATTATAAATCACTGAGCTG 

ATATTTACTCTTCCTTTTAAGTTTTCTAAGTACGTCTGTAGCATGATGGTATAGATTTTC 

TTGTTTCAGTGCTTTGGGACAGATTTTATATTATGTCAATTGATCAGGTTAAAATTTTCA 

GTGTGTAGTTGGCAGATATTTTCAAAATTACAATGCATTTATGGTGTCTGGGGGCAGGG 

GAACATCAGAAAGGTTAAATTGGGCAAAAATGCGTAAGTCACAAGAATTTGGATGGTG 

CAGTTAATGTTGAAGTTACAGCATTTCAGATTTTATTGTCAGATATTTAGATGTTTGTTA 

CATTTTTAAAAATTGCTCTTAATTTTTAAACTCTCAATACAATATATTTTGACCTTACCA 

TTATTCCAGAGATTCAGTATTAAAAAAAAAAAAATTACACTGTGGTAGTGGCATTTAA 

ACAATATAATATATTCTAAACACAATGAAATAGGGAATATAATGTATGAACTTTTTGCA 

TTGGCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTTACCTAATAAACATT 

TTATACTGTTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTC 



Figure 100 

MARRSAFPAAALWLWSILLCLLALRAEAGPPQEESLYLWIDAHQARVLIGFEEDILIVSEGK 

MAPFTHDFRKAQQRMPAIPVNmSMNFTWQAAGQAEYFYEFLSLRSLDKG 

LLGTWHKASWQVGFPCLGKQDGVAAFEVDVIVMNSEGNTILQTPQNA1FFKTCLQAECP 

GGCRNGGFCNEREICECPDGFHGPHCEKALCTPRCMNGGLCVTPGFCICPPGFYGVNCDK 

ANCSTTCFNGGTCFYPGKCICPPGLEGEQCEISKCPQPCRNGGKCIGKSKCKCSKGYQGDL 

CSKPVCEPGCGAHGTCHEPNKCQCQEGWHGRHCNKRYEASLIHALR 

PAGAQLRQHTPSLKEAEERRDPPESNYIW 

Figure 101 

ATGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCAGCCCTGGGCGTGCTGCTG 

GCGCTGGGCGCGGCGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCT 

TCCAGTCGGACATCGGCCCGTACCAGAGCGGGCGCTTCTACACCAAGCCACCTCAGTG 

CGTGGACATCCCCGCGGACCTGCGGCTGTGCCACAACGTGGGCTACAAGAAGATGGTG 

CTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGGTGAAGCAGCAGGCCAGCAGC 

TGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGCACCCAGGTCTTCCTCTGCTCGCT 

CTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTCTGCGAGGCCG 

TGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGATGCTT 

AAGTGTGACAAGTTCCCCGAGGGGGACGTCTGCATCGCCATGACGCCGCCCAATGCCA 

CCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAGTTGAA 

ATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGGGCTGAGTTTAAAGATGA 

TTGTGGGTAGCTCCCATAACTCATGCTGCACGCTGGGTCCTTCTCATCCCAACTCCTCA 

AAGCGGCAGGAGCAGGAACTGGGGACTCCTGAGAGAAGGCTTGGATATGGCCTTTTAT 

TACACTTCATCCAAGGAAATCTGCCCCCACCCTGTGCCCAGGCCCGATCACGCATGAG 

GCTAAAGACGGAGGCCACTCCGCTGGCTCTGGGTAGATCTGCCCCTGGACTGTTTGCC 

GACTGCCCGGAGCGCCCTCTGCCGGTCTGCAGCTTCCCACACCACACGGAAGAAGTGG 

GGAAACTGAGGATACATTCTTTCCTCCTCCAGGTAAAGGGATTCTCAATGAAGGGCTTG 

TGTGCACCTTCCACACTTAGATACCTCTACTACCTGAAAACCAGCATGCAGCATGTACA 

TCAAGAGTACCAGGCACATAGTGCTCAAGTCTGGGCTAATATGCCACCTGCAGAGAGA 

TGTAAAGATGAAGAAGACAAAGCCATGTTTTCAAAGTGA 
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Figure 102 

MGIGRSEGGRRGAALGVLLALGAALLAVGSASEYDYVSFQSDIGPYQSGRFYTKPPQCVDI 

PADLRLCHNVGYKKMVLPNLLEHETMAE\/KQQASSWVPLLNKNCHAGTQVFLCSLFAPV 

CLDRPrYPCRWLCEAVRDSCEPVMQFFGFYWPEMLKCDKFPEGDVCIAMTPPNATEASKP 

QGTTVCPPCDNELKSEAIIEHLCASEFGLSLKMrVGSSHNSCCTLGPSHPNSSKRQEQELGTP 

ERREGYGLLLBCFIQGNLPPPCAQARSRMRLKTEATPLALGRSAPGLFADCPERPLPVCSFPH 

HTEEVGKLRmSFLLQVKGFSMKGLCAPSTLRYLYYLKTSMQHVHQEYQAHSAQVWANM 

PPAERCKDEEDKAMFSK 



Figure 103 

ggcgggttcgcgccccgaaggctgagagctggcgctgctcgtgccctgtgtgccagac 

ggcggagctccgcggccggaccccgcggccccgctttgctgccgactggagtttgggg 

gaagaaactctcctgcgccccagaagatttcttcctcggcgaagggacagcgaaagat 

gagggtggcaggaagagaaggcgctttctgtctgccggggtcgcagcgcgagagggc 

agtgccatgttcctctccatcctagtggcgctgtgcctgtggctgcacctggcgctggg 

cgtgcgcggcgcgccctgcgaggcggtgcgcatgcctatgtgccggcacatgccctgg 

aacatcacgcggatgcccaaccacctgcaccacagcacgcaggagaacgccatcctgg 

ccatcgagcagtacgaggagctggtggacgtgaactgcagcgccgtgctgcgcttctt 

cttctgtgccatgtacgcgcccatttgcaccctggagttcctgcacgaccctatcaagc 

cgtgcaagtcggtgtgccaacgcgcgcgcgacgactgcgagcccctcatgaagatgta 

caaccacagctggcccgaaagcctggcctgcgacgagctgcctgtctatgaccgtggc 

gtgtgcatttcgcctgaagccatcgtcacggacctcccggaggatgttagtggataga 

catcacaccagacatgatggtacaggaaaggcctcttgatgttgactgtaaacgccta 

agccccgatcggtgcaagtgtaaaaaggtgaagccaactttggcaacgtatctcagca 

aaaactacagctatgttattcatgccaaaataaaagctgtgcagaggagtggctgcaa 

tgaggtcacaacggtggtggatgtaaaagagatcttcaagtcctcatcacccatccctc 

gaactcaagtcccgctcattacaaattcttcttgccagtgtccacacatcctgccccat 

caagatgttctcatcatgtgttacgagtggcgttcaaggatgatgcttcttgaaaattg 

cttagttgaaaaatggagagatcagcttagtaaaagatccatacagtgggaagagag 

gctgcaggaacagcggagaacagttcaggacaagaagaaaacagccgggcgcaccag 

tcgtagtaatccccccaaaccaaagggaaagcctcctgctcccaaaccagccagtccc 

aagaagaacattaaaactaggagtgcccagaagagaacaaacccgaaaagagtgtga 

gctaactagtttccaaagcggagacttccgacttccttacaggatgaggctgggcattg 

cctgggacagcctatgtaaggccatgtgccccttgccctaacaactcactgcagtgct'c 

ttcatagacacatcttgcagcatttttcttaaggctatgcttcagtttttctttgtaagc 

catcacaagccatagtggtaggtttgccctttggtacagaaggtgagttaaagctggt 

ggaaaaggcttattgcattgcattcagagtaacctgtgtgcatactctAgaagagtag 

ggaaaataatgcttgttacaattcgacctaatatgtgcattgtaaaataaatgccatat 

ttcaaacaaaacacgtaatttttttacagtatgttttattaccttttgatatctgttgtt 

gcaatgttagtgatgttttaaaatgtgatgaaaatataatgtttttaagaaggaacagt 

agtggaatgaatgttaaaagatctttatgtgtttatggtctgcagaaggatttttgtga 

tgaaaggggattttttgaaaaattagagaagtagcatatggaaaattataatgtgttt 

ttttaccaatgacttcagtttctgtttttagctagaaacttaaaaacaaaaataataat 

aaagaaaaataaataaaaaggagaggcagacaatgtctggattcctgttttttggtta 
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CCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCTCTTAAGCAGCACCAGAAACA 

GTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATTAGTTGGCTAATGCTCAAGT 

ATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAGGACATCCACCCTGA 

GAATAATTTGACAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTGTTTTTCTTC 

ATTTAAATATTTTCTTTGCCTAAATACATGTGAGAGGAGTTAAATATAAATGTACAGAG 

AGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGACAGTTGGGATACTTTAATC 

AGAAAAAAAGAACTTATTTGCAGCATTTTATCAACAAATTTCATAATTGTGGACAATTG 

GAGGCATTTATTTTAAAAAACAATTTTATTGGCCTTTTGCTAACACAGTAAGCATGTAT 

TTTATAAGGCATTCAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCC 

TACTCTCCACTACACAGAGGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGT 

TTGCTTATGCACTTATAAAATGATTTGAACAAATAAAACTAGGAACCTGTATACATGTG 

TTTCATAACCTGCCTCCTTTGCTTGGCCCTTTATTGAGATAAGTTTTCCTGTCAAGAAAG 

CAGAAACCATCTCATTTCTAACAGCTGTGTTATATTCCATAGTATGCATTACTCAACAA 

ACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGACAATACTGAATAAACATCT 

CACCGGAATTC 



Figure 104 

MFLSILVALCLWLHLALGVRGAPCEAVRIPM 

EELVDVNCSAVLRFFFCAMYAPICTLEFLHDPIKPCKSVCQRAi^DCEPLMKMYNHSWPES 

LACDELPVYDRGVCISPEAIVTDLPEDVKWIDITPDMMVQERPLDVDCKRLSPDRCKCKKV 

KPTLATYLSKNYSYVIHAEJKAVQRSGCNEVTWVDVKEIFKSSSPIPRTQVPLITNSSCQCP 

HILPHQDVLIMCYEWRSRMMLLENCLVEKWRDQLSKRSIQWEERLQEQRRTVQDKKJKLTA 

GRTSRSNPPKPKGEPPAPKPASPKKNIKTRSAQKRTNPKRV 
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Figure I 
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